Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update customAllReduceKernels.cu - line 120's typo was edited
#1558
opened May 8, 2024 by
sjbae1999
Loading…
[fix] export failure with CUDA driver < 526 and pynvml>=11.5.0
#1537
opened May 3, 2024 by
CoderHam
Loading…
Loading Medusa Safetensors + AWQ Conversion correction
#1535
opened May 2, 2024 by
Tushar-ml
Loading…
Define hf_config explisitly for convert_hf_mpt_legacy
#1534
opened May 2, 2024 by
bloodeagle40234
Loading…
fix: correct cudaSetDevice error when GPUs per node are fewer than their ranks in inter-node inference
#1495
opened Apr 24, 2024 by
littlefatfat
Loading…
[feat]: Add Option to convert and run distil-whisper large-v3
#1337
opened Mar 22, 2024 by
IbrahimAmin1
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.