forked from vllm-project/vllm
Pull requests: neuralmagic/nm-vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Please do not delete - comparing changes between branches
#203
opened Apr 23, 2024 by
afeldman-nm
Loading…
Initial
CompressedTensors
config + Activation Quantization support for static W8A8 per tensor
#195
opened Apr 18, 2024 by
dsikka
Loading…
[WIP][Core] Add Automatic Prefix Caching to BlockSpaceManagerV2
#171
opened Apr 8, 2024 by
SageMoore
Loading…
[WIP] Upstream encoder/decoder support based on multiple blocktables
#161
opened Apr 2, 2024 by
afeldman-nm
•
Draft
[Timings] Add the ability to log times for async and sync calls
#152
opened Mar 27, 2024 by
dsikka
Loading…
ProTip!
Updated in the last three days: updated:>2024-04-25.