Does enc-dec model support inflight bathing? #1573

Oldpan · 2024-05-10T08:40:22Z

I found that trt-llm-dev supports paged KV cache for encoder-decoder models (like Nougat). Does the encoder-decoder model work with inflight batching? Or can we use other batching methods other than static batching to improve performance? Thanks

schetlur-nv · 2024-05-16T22:31:50Z

@Oldpan for now we only support static batching for encoder-decoder models. But we plan inflight batching support in the coming weeks. Please keep an eye out. Feel free to reopen the issue if you have more questions.

byshiue added the question Further information is requested label May 11, 2024

byshiue assigned symphonylyh May 11, 2024

schetlur-nv closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does enc-dec model support inflight bathing? #1573

Does enc-dec model support inflight bathing? #1573

Oldpan commented May 10, 2024 •

edited

schetlur-nv commented May 16, 2024

Does enc-dec model support inflight bathing? #1573

Does enc-dec model support inflight bathing? #1573

Comments

Oldpan commented May 10, 2024 • edited

schetlur-nv commented May 16, 2024

Oldpan commented May 10, 2024 •

edited