How would one go about running embedding as a service using something like vLLM? #2

sungkim11 · 2024-02-18T05:00:39Z

I would like to run embedding as a service using something like vLLM on a Docker container on different host. How would one go about doing this?

Muennighoff · 2024-02-18T16:43:38Z

I think it should be easy to serve GritLM using vLLM or similar and providing access to its embedding capability / its language modeling capability or both in one single model / endpoint. But I'm not sure about the details of vllm etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How would one go about running embedding as a service using something like vLLM? #2

How would one go about running embedding as a service using something like vLLM? #2

sungkim11 commented Feb 18, 2024

Muennighoff commented Feb 18, 2024

How would one go about running embedding as a service using something like vLLM? #2

How would one go about running embedding as a service using something like vLLM? #2

Comments

sungkim11 commented Feb 18, 2024

Muennighoff commented Feb 18, 2024