Skip to content

Pull requests: FMInference/FlexGen

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add support for Llama and Qwen models
#135 opened Mar 29, 2024 by marswen Loading…
[Feature] Intel dGPU/SYCL support
#125 opened Sep 25, 2023 by abhilash1910 Loading…
Add support for symmetric quantization
#124 opened Sep 16, 2023 by julian-q Loading…
fix torchrun inference
#112 opened Apr 25, 2023 by fsx950223 Loading…
Allow FlexGen to use locally downloaded models
#111 opened Apr 24, 2023 by Vinkle-hzt Loading…
Add SkyPilot example for running benchmarks
#96 opened Mar 9, 2023 by Michaelvll Loading…
1 task done
CPU and M1/M2 GPU platform support
#80 opened Mar 1, 2023 by xiezhq-hermann Loading…
ProTip! Exclude everything labeled bug with -label:bug.