enableBlockReuse
option is not available for tensorrt_llm.runtime.ModelRunner
#1594
Open
2 of 4 tasks
Labels
bug
Something isn't working
System Info
Nothing to do with hardware.
Who can help?
@kaiyux or @ncomly-nvidia
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
nothing to reproduce
Expected behavior
we want to enable common prefix caching, which seems this keyword
enableBlockReuse
is doing that for other runtime.actual behavior
we are able to specify
enableBlockReuse
when usingtensorrt_llm.runtime.ModelRunner
additional notes
no
The text was updated successfully, but these errors were encountered: