`enableBlockReuse` option is not available for `tensorrt_llm.runtime.ModelRunner` #1594

yupbank · 2024-05-13T18:35:30Z

Nothing to do with hardware.

nothing to reproduce

we want to enable common prefix caching, which seems this keyword enableBlockReuse is doing that for other runtime.

we are able to specify enableBlockReuse when using tensorrt_llm.runtime.ModelRunner

no

The text was updated successfully, but these errors were encountered:

yupbank · 2024-05-14T01:09:03Z

there is no way i can enable enableBlockReuse

dcampora · 2024-05-15T08:01:17Z

Thanks for the report @yupbank . We're on it and it will be fixed in the next release.

yupbank added the bug Something isn't working label May 13, 2024

byshiue assigned MartinMarciniszyn May 15, 2024

MartinMarciniszyn assigned dcampora May 15, 2024

Provide feedback