Batching based on another field in the instance #3412

cceyda · 2024-02-05T23:09:18Z

cceyda
Feb 5, 2024

I figured out that the batcher expects inputs to be in the form of {"instances":[...]} and combines based on instances list length & maxBatchSize setting.

Let's say my custom transformer model expects inputs like:
{"context": "hello", "questions": ['a','b','c',...]}

So if 2 requests like below are made:

request1 = {"instances": [{"context": "hello", "questions": ['a','b','c','d']}]}
request2 = {"instances": [{"context": "bye", "questions": ['e']}]}

they are combined like so (assuming maxBatchSize=4):

combined_request = {
    "instances": request1["instances"] + request2["instances"]
}
# which is equal to
combined_request = {'instances': [
    {'context': 'hello', 'questions': ['a', 'b', 'c', 'd']},
    {'context': 'bye', 'questions': ['e']}
    ]
}

But in my case I want to calculate batch size of my requests based on the length of the instances.questions key,
which in the example above is 4 & 1 so they shouldn't be combined into a batch.

I know I can flatten the model input structure so it is like 1:1 context:question but due to other optimization concerns that is not useful to me.

So my question is: Is it possible to batch based on another field?
I figure it isn't currently possible, cause the batcher implementation seems pretty hardcoded on the expected input/output structure. I assume adding some flexibilty to how the batcher behaves through settings could also allow batcher to be compatible with v2 protocol requests #2275 too

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching based on another field in the instance #3412

{{title}}

Replies: 0 comments

Select a reply

Batching based on another field in the instance #3412

cceyda Feb 5, 2024

Replies: 0 comments

cceyda
Feb 5, 2024