#

inference

Here are 1,198 public repositories matching this topic...

deepjavalibrary / djl-serving

A universal scalable machine learning model deployment solution

deep-learning deployment inference pytorch serving djl

Updated Jun 6, 2024
Java

huggingface / text-generation-inference

Large Language Model Text Generation Inference

nlp bloom deep-learning inference pytorch falcon transformer gpt starcoder

Updated Jun 6, 2024
Python

openvino_notebooks

openvinotoolkit / openvino_notebooks

📚 Jupyter notebook tutorials for OpenVINO™

machine-learning computer-vision deep-learning inference openvino

Updated Jun 6, 2024
Jupyter Notebook

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda inference pytorch transformer llama gpt rocm model-serving mlops llm inferentia llmops llm-serving trainium

Updated Jun 6, 2024
Python

CapStats-ML / CapStats

Espero que en este repo encuentres inspiración para aprender y desarrollarte en el mundo de la Estadística, no soy perfecto en todo así que si tienes una sugerencia la aceptares con todo el gusto, espero disfrutes lo que puedes encontrar en la página **Este repo aun esta en construcción**

python blog machine-learning r statistics probability regression inference statistical-analysis

Updated Jun 6, 2024
HTML

huggingface.js

huggingface / huggingface.js

Utilities to use the Hugging Face Hub API

machine-learning inference hub api-client huggingface

Updated Jun 6, 2024
TypeScript

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated Jun 6, 2024
Python

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Jun 6, 2024
Python

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learning gpu inference edge datacenter

Updated Jun 6, 2024
Python

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Jun 6, 2024
Python

inference

roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Updated Jun 6, 2024
Python

huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

optimization intel transformers inference pruning quantization distillation onnx openvino diffusers

Updated Jun 6, 2024
Jupyter Notebook

whisper.cpp

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 6, 2024
C

aikit

sozercan / aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Updated Jun 6, 2024
Go

google / jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

inference pytorch batching attention llama gemma model-serving tpu llm llm-inference llama2

Updated Jun 6, 2024
Python

typedb

vaticle / typedb

TypeDB: the polymorphic database powered by types

database polymorphic logic inference polymorphism knowledge-base type-system strongly-typed knowledge-representation reasoning typedb typeql

Updated Jun 6, 2024
Java

al1sant0s / Hypothesis-Tests

Python package to perform statistical hypothesis tests.

test inference p-value hypothesis z-score

Updated Jun 6, 2024
Python

ColossalAI

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

ai deep-learning hpc distributed-computing inference big-model large-scale data-parallelism model-parallelism pipeline-parallelism foundation-models heterogeneous-training

Updated Jun 6, 2024
Python

zjhellofss / KuiperInfer

带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

deep-neural-networks caffe deep-learning graph-algorithms inference pytorch yolo convolution diy resnet maxpooling sigmoid inference-engine ncnn relu yolov5 pnnx

Updated Jun 6, 2024
C++

openvinotoolkit / openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 6, 2024
C++

Improve this page

Add a description, image, and links to the inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference topic, visit your repo's landing page and select "manage topics."