Concurrency Issue: 'Server Busy' Errors After Updating Ollama #4159

mynhinguyentruong · 2024-05-05T01:18:35Z

What is the issue?

Issue Summary:
Yesterday, I conducted a test where I spun up 30 concurrent goroutines and sent them as POST requests to Ollama locally. The process worked smoothly, and I received responses within approximately 2 minutes.

go ProcessPromptWithOllama() x30 times

Problem Description:
However, after updating Ollama today, I encountered the following errors:

\"error\":\"server busy, please try again. maximum pending requests exceeded\"}
\"error\":\"unexpected server status: llm busy - no slots available\"}
Reproducibility:
The issue is consistently reproducible after the Ollama update. It occurs regardless of the specific endpoint or payload used in the POST requests.

Expected Behavior:
I expected the updated Ollama to handle the concurrent requests as efficiently as it did before the update, without encountering any server overload issues.

OS

macOS

GPU

Apple

CPU

Apple

Ollama version

0.1.33

The text was updated successfully, but these errors were encountered:

jmorganca · 2024-05-05T01:40:12Z

Hi @mynhinguyentruong, I'm so sorry about this issue – we'll get this fixed so more requests can be queued

dhiltgen · 2024-05-05T03:36:53Z

Dup of #4124

mynhinguyentruong added the bug Something isn't working label May 5, 2024

jmorganca assigned dhiltgen and jmorganca and unassigned dhiltgen May 5, 2024

jmorganca mentioned this issue May 5, 2024

Fix no slots available error with concurrent requests #4160

Merged

dhiltgen closed this as completed May 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrency Issue: 'Server Busy' Errors After Updating Ollama #4159

Concurrency Issue: 'Server Busy' Errors After Updating Ollama #4159

mynhinguyentruong commented May 5, 2024 •

edited

jmorganca commented May 5, 2024

dhiltgen commented May 5, 2024

Concurrency Issue: 'Server Busy' Errors After Updating Ollama #4159

Concurrency Issue: 'Server Busy' Errors After Updating Ollama #4159

Comments

mynhinguyentruong commented May 5, 2024 • edited

What is the issue?

OS

GPU

CPU

Ollama version

jmorganca commented May 5, 2024

dhiltgen commented May 5, 2024

mynhinguyentruong commented May 5, 2024 •

edited