Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
-
Updated
May 23, 2024 - Python
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
TVM Documentation in Chinese Simplified / TVM 中文文档
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.
FlashInfer: Kernel Library for LLM Serving
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
Open, Modular, Deep Learning Accelerator
TON Foundation invites talent to imagine and realize projects that have the potential to integrate with the daily lives of users.
比做算法的懂工程落地,比做工程的懂算法模型。
Optimizing Mobile Deep Learning on ARM GPU with TVM
Solidity compiler for TVM
A home for the final text of all TVM RFCs.
Add a description, image, and links to the tvm topic page so that developers can more easily learn about it.
To associate your repository with the tvm topic, visit your repo's landing page and select "manage topics."