model-inference-service

Here are 4 public repositories matching this topic...

bentoml / BentoML

The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

python machine-learning deep-learning model-serving multimodal mlops ml-engineering llm generative-ai llmops llm-serving model-inference-service llm-inference inference-platform

Updated May 10, 2024
Python

bentoml / CLIP-API-service

Star

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

cloud-native clip model-serving mlops ai-applications openai-clip model-inference model-inference-service

Updated Jan 15, 2024
Jupyter Notebook

bentoml / transformers-nlp-service

Star

Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more

nlp transformer nlp-machine-learning model-deployment model-serving mlops online-inference llm llmops model-inference-service

Updated Mar 16, 2024
Python

ksm26 / Efficiently-Serving-LLMs

Star

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.

text-generation batch-processing server-optimization model-serving model-acceleration inference-optimization optimization-techniques machine-learning-operations deep-learning-techniques model-inference-service performance-enhancement scalability-strategies serving-infrastructure large-scale-deployment

Updated Apr 12, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the model-inference-service topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-inference-service topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-inference-service

Here are 4 public repositories matching this topic...

bentoml / BentoML

bentoml / CLIP-API-service

bentoml / transformers-nlp-service

ksm26 / Efficiently-Serving-LLMs

Improve this page

Add this topic to your repo