Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q2 2024
#3861 opened Apr 4, 2024 by simon-mo
Open 28
v0.4.3 Release Tracker
#4895 opened May 18, 2024 by simon-mo
Open 12
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

[New Model]: IBM Granite Code Models new model Requests to new models
#5095 opened May 29, 2024 by Semihal
[Bug]: Can't run vllm distributed inference with vLLM + Ray bug Something isn't working
#5094 opened May 29, 2024 by linchen111
[Bug]: Gemma model fails with GPTQ marlin bug Something isn't working
#5088 opened May 28, 2024 by arunpatala
[Bug]: The vllm is disconnected after running for some time bug Something isn't working
#5084 opened May 28, 2024 by zxcdsa45687
[Performance]: A few performance-related questions. performance Performance-related issues
#5072 opened May 27, 2024 by maxin9966
[Bug]: Build/Install Issues with pip install -e . bug Something isn't working
#5071 opened May 27, 2024 by Msiavashi
[Bug]: When load model weights, there are infinite loading bug Something isn't working
#5062 opened May 27, 2024 by tjrlwjd1
Running Vllm on ray cluster, logging stuck at loading bug Something isn't working
#5052 opened May 25, 2024 by maherr13
[Installation]: installation Installation problems
#5048 opened May 25, 2024 by Kastycupra
[Bug]: 英伟达最新驱动555.85,vllm运行报错 bug Something isn't working
#5035 opened May 24, 2024 by gaye746560359
ProTip! Find all open issues with in progress development work with linked:pr.