Skip to content

Pull requests: intel/xFasterTransformer

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Kernel] Add dynamic onednn matmul. performance performance related.
#425 opened May 28, 2024 by changqi1 Draft
[Distribute] Add distribute support for continuous batching api. continuous batching continuous batching enhancement New feature or request
#421 opened May 24, 2024 by Duyi-Wang Loading…
[Kernel] Add GPU kernels. enhancement New feature or request gpu Related to GPU
#372 opened May 7, 2024 by changqi1 Loading…
[Model] Achieve whole pipeline parallel. enhancement New feature or request gpu Related to GPU
#355 opened Apr 28, 2024 by changqi1 Draft
[Eval] Add eval test with opencompass. benchmark performance or accuracy benchmark enhancement New feature or request
#325 opened Apr 17, 2024 by marvin-Yu Draft
Update AWQ GPTQ quantization guide documentation Improvements or additions to documentation
#306 opened Apr 10, 2024 by miaojinc Loading…
[Kernel] Add oneDNN GPU kernels. gpu Related to GPU performance performance related.
#253 opened Feb 29, 2024 by changqi1 Draft
[Kernel] Add oneDNN GPU kernels. gpu Related to GPU performance performance related.
#236 opened Feb 21, 2024 by changqi1 Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.