Skip to content

Pull requests: intel/intel-extension-for-transformers

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[vLLM] optimizing vLLM models by qbits.
#1548 opened May 15, 2024 by Zhenzhong1 Draft
1 of 4 tasks
Support ipex cpu WOQ backend
#1546 opened May 14, 2024 by changwangss Loading…
lm-eval for llama.cpp enhancement.
#1543 opened May 12, 2024 by lkk12014402 Loading…
catch prepack error and fallback to torch bf16
#1526 opened May 6, 2024 by Spycsh Loading…
Removed fallback for lm_head op WIP
#1482 opened Apr 15, 2024 by PenghuiCheng Loading…
Add text-gen finetune workflow for glue mnli
#1478 opened Apr 12, 2024 by mini-goel Loading…
h2o for kv cache compression WIP
#1468 opened Apr 10, 2024 by n1ck-guo Loading…
1 of 4 tasks
add FP8Config habana
#1442 opened Apr 1, 2024 by mengniwang95 Loading…
add gaudi modeling support in itrex habana
#1438 opened Mar 29, 2024 by ClarkChin08 Loading…
[NeuralChat] RAG evaluation NeuralChat
#1333 opened Mar 1, 2024 by Liangyx2 Loading…
ProTip! Updated in the last three days: updated:>2024-05-12.