Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
#1940
opened May 23, 2024 by
Narsil
Loading…
5 tasks
Enhance AsyncClient for Flexible Session Management in Text Generation Client
#1784
opened Apr 21, 2024 by
BenHaimItay
Loading…
4 tasks done
ProTip!
Follow long discussions with comments:>50.