Skip to content

Issues: triton-inference-server/tensorrtllm_backend

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

No 24.05-trtllm-python-py3 in NGC Repo
#476 opened May 25, 2024 by avianion
tensorrt_llm_bls disregards temperature setting bug Something isn't working
#472 opened May 23, 2024 by janpetrov
1 of 4 tasks
[Bug] Zero temperature curl request affects non-zero temperature requests bug Something isn't working
#464 opened May 20, 2024 by Hao-YunDeng
2 of 4 tasks
Tritonserver won't start up running Smaug 34b bug Something isn't working
#459 opened May 15, 2024 by workuser12345
2 of 4 tasks
Mixtral 8x7-v0.1 Hangs after serving a few requests bug Something isn't working
#457 opened May 15, 2024 by aaditya-srivathsan
2 of 4 tasks
Example gpu_device_ids for multi-model usage? question Further information is requested
#448 opened May 9, 2024 by vnkc1
2 of 4 tasks
InFlightBatching seems not working need more info triaged Issue has been triaged by maintainers
#442 opened May 6, 2024 by larme
2 of 4 tasks
Deployement failed for BERT triaged Issue has been triaged by maintainers
#440 opened May 3, 2024 by vivekjoshi556
Deploying Mixtral-8x7B-v0.1 with Triton 24.02 on A100 (160GB) raises "Cuda Runtime (out of memory)" exception bug Something isn't working triaged Issue has been triaged by maintainers
#438 opened Apr 29, 2024 by kelkarn
2 of 4 tasks
Encountered an error in forward function: std::bad_cast bug Something isn't working
#435 opened Apr 26, 2024 by wangqy1216
1 of 4 tasks
max_batch_size seems to have no impact on model performance bug Something isn't working triaged Issue has been triaged by maintainers
#429 opened Apr 23, 2024 by VitalyPetrov
3 of 4 tasks
Performance Issue with return_context_logits Enabled in TensorRT-LLM bug Something isn't working
#428 opened Apr 23, 2024 by gywlssww
2 of 4 tasks
Seg fault after loaded models in official example bug Something isn't working
#425 opened Apr 20, 2024 by LeatherDeerAU
2 of 4 tasks
ProTip! Follow long discussions with comments:>50.