Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[API] Add LLaMA decoder API.
interface
related to interface
#386
opened May 11, 2024 by
changqi1
Loading…
[MOCK PR] Continuous Batching Check
continuous batching
continuous batching
enhancement
New feature or request
#357
opened Apr 29, 2024 by
pujiang2018
•
Draft
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
Update AWQ GPTQ quantization guide
documentation
Improvements or additions to documentation
#306
opened Apr 10, 2024 by
miaojinc
Loading…
ProTip!
no:milestone will show everything without a milestone.