Skip to content

Pull requests: neuralmagic/nm-vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Upstream sync 2024 04 26
#211 opened Apr 26, 2024 by robertgshaw2-neuralmagic Loading…
Add lm-eval correctness test
#210 opened Apr 25, 2024 by dbarbuzzi Loading…
2 tasks
update workflows to use generated whls
#204 opened Apr 23, 2024 by andy-neuma Loading…
Add test framework for server
#200 opened Apr 22, 2024 by dbarbuzzi Draft
Upstream sync 2024 04 21
#198 opened Apr 22, 2024 by robertgshaw2-neuralmagic Loading…
[WIP] FLAN-T5 integration
#194 opened Apr 17, 2024 by afeldman-nm Loading…
WIP: basic correctness test
#192 opened Apr 17, 2024 by derekk-nm Draft
whl centric
#191 opened Apr 17, 2024 by andy-neuma Loading…
Added Docker Compose Example
#182 opened Apr 12, 2024 by robertgshaw2-neuralmagic Loading…
vllm - quantization : DO NOT MERGE
#180 opened Apr 11, 2024 by varun-sundar-rabindranath Loading…
Pypi and updates
#177 opened Apr 9, 2024 by andy-neuma Loading…
Support for compressed-tensors
#159 opened Apr 2, 2024 by dbogunowicz Loading…
[WiP] Whisper Implementation
#147 opened Mar 26, 2024 by dbogunowicz Loading…
Prometheus deliverable 1+2
#93 opened Mar 5, 2024 by horheynm Loading…
[WIP] afeldman-nm/encoder decoder
#22 opened Feb 16, 2024 by afeldman-nm Loading…
ProTip! Updated in the last three days: updated:>2024-04-25.