Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

ROCm fixes
#2033 opened Jun 6, 2024 by fxmarty Loading…
feat: re-allocate pages dynamically
#2024 opened Jun 5, 2024 by OlivierDehaene Loading…
Internal runner ?
#2023 opened Jun 5, 2024 by Narsil Loading…
5 tasks
Allow to set manually paged attention num_blocks
#2016 opened Jun 5, 2024 by fxmarty Loading…
Xpu gqa
#2013 opened Jun 5, 2024 by sywangyi Loading…
5 tasks
Parallel AMD build.
#2011 opened Jun 4, 2024 by Narsil Loading…
5 tasks
Lora internal
#2010 opened Jun 4, 2024 by drbh Draft
4 tasks
Split build workflow for multiple plateforms
#2005 opened Jun 4, 2024 by fxmarty Loading…
Testing XPU ci.
#1995 opened Jun 3, 2024 by Narsil Loading…
5 tasks
server: use chunked inputs
#1985 opened May 31, 2024 by danieldk Loading…
1 of 5 tasks
feat: add precompile kernels workflow
#1971 opened May 29, 2024 by drbh Draft
implement Open Inference Protocol endpoints
#1942 opened May 23, 2024 by drbh Loading…
Cpu tgi
#1936 opened May 23, 2024 by sywangyi Loading…
5 tasks
add ascend npu support for TGI Stale
#1740 opened Apr 14, 2024 by statelesshz Draft
5 tasks
ProTip! Filter pull requests by the default branch with base:main.