[Request] llama对话过程能否保活？ #2240

gaye746560359 · 2024-04-27T11:58:51Z

🥰 需求描述

我是docker部署的ollama，机器是i-3600k+rtx4090
在用lobechat使用llama模型对话时有两处需要优化：
1.启动对话要很久的时间，一直加载中，应该是执行ollama run llama很费时间
2.对话过程暂停一段时间后，再次对话又需要等待很久

🧐 解决方案

能否在启动对话后保活模型？或者有其他方案？

📝 补充信息

No response

lobehubbot · 2024-04-27T11:59:03Z

👀 @gaye746560359

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible.
Please make sure you have given us as much context as possible.
非常感谢您提交 issue。我们会尽快调查此事，并尽快回复您。请确保您已经提供了尽可能多的背景信息。

lobehubbot · 2024-04-27T11:59:03Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

🥰 Description of requirements

I deployed ollama with docker, and the machine is i-3600k+rtx4090
There are two things that need to be optimized when using llama model dialogue with lobechat:

It takes a long time to start the conversation and it keeps loading. It should be that executing ollama run llama is very time-consuming.
After the dialogue process is paused for a period of time, it will take a long time to talk again.

🧐 Solution

Is it possible to keep a model alive after starting a conversation? Or are there other options?

📝 Supplementary information

No response

arvinxx · 2024-04-27T14:17:07Z

这可能要问下 ollama 的社区了

lobehubbot · 2024-04-27T14:17:19Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

You might have to ask the ollama community about this.

MapleEve · 2024-04-28T01:38:11Z

看Ollama 部署的文档，我这边 PR 过对应的环境变量可以让模型保活时间变长

lobehubbot · 2024-04-28T01:38:22Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Looking at the Ollama deployment documentation, I PRed the corresponding environment variables to make the model stay alive longer.

gaye746560359 added the 🌠 Feature Request New feature or request | 特性与建议 label Apr 27, 2024

lobehub locked and limited conversation to collaborators Apr 28, 2024

arvinxx converted this issue into discussion #2260 Apr 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

[Request] llama对话过程能否保活？ #2240

[Request] llama对话过程能否保活？ #2240

gaye746560359 commented Apr 27, 2024

lobehubbot commented Apr 27, 2024

lobehubbot commented Apr 27, 2024

arvinxx commented Apr 27, 2024

lobehubbot commented Apr 27, 2024

MapleEve commented Apr 28, 2024

lobehubbot commented Apr 28, 2024

This issue was moved to a discussion.

This issue was moved to a discussion.

[Request] llama对话过程能否保活？ #2240

[Request] llama对话过程能否保活？ #2240

Comments

gaye746560359 commented Apr 27, 2024

🥰 需求描述

🧐 解决方案

📝 补充信息

lobehubbot commented Apr 27, 2024

lobehubbot commented Apr 27, 2024

🥰 Description of requirements

🧐 Solution

📝 Supplementary information

arvinxx commented Apr 27, 2024

lobehubbot commented Apr 27, 2024

MapleEve commented Apr 28, 2024

lobehubbot commented Apr 28, 2024

This issue was moved to a discussion.