int8-inference

Star

Here are 11 public repositories matching this topic...

ENOT-AutoDL / gpt-j-6B-tensorrt-int8

Star

GPT-J 6B inference on TensorRT with INT-8 precision

transformers inference quantization tensorrt int8-inference gpt-j gpt-j-6b enot-autodl

Updated Apr 5, 2023
Python

daniel-rychlewski / cnn-planesnet

Star

Compressed CNNs for airplane classification in satellite images (APoZ-based parameter pruning, INT8 weight quantization)

Updated Jun 10, 2020
Python

ENOT-AutoDL / ENOT-transformers

Star

transformers inference quantization tensorrt int8-inference gpt2 int8-quantization gptj enot-autodl

Updated Jun 8, 2023

yester31 / TensorRT_ONNX

Star

Generating tensorrt model using onnx

pytorch quantization tensorrt onnx int8-inference onnxruntime post-training-quantization int8-quantization tensorrt-inference ptq

Updated Jun 22, 2023
C++

whitelok / tensorrt-int8-python-sample

Star

TensorRT Int8 Python version sample. TensorRT Int8 Python 实现例子。TensorRT Int8 Pythonの例です

python machine-learning ai deep-learning inference nvidia tensorrt int8 int8-inference tensorrt-int8-python

Updated Jan 28, 2019
Python

akashAD98 / yolov7_vino_with_object_tracking

Star

it has support for openvino converted model of yolov7-int.xml ,yolov7x,

onnx deepsort openvino int8-inference yolov7

Updated Mar 6, 2023
Python

Howell-Yang / onnx2trt

Star

将端上模型部署过程中，常见的问题以及解决办法记录并汇总，希望能给其他人带来一点帮助。

python tensorrt calibrator int8-inference int8-quantization

Updated Aug 17, 2022
Python

jahongir7174 / YOLOv8-qat

Star

Quantization Aware Training

python pytorch object-detection int8-inference quantization-aware-training int8-quantization yolov8

Updated Jan 13, 2024
Python

DerryHub / BEVFormer_tensorrt

Star

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

cuda pytorch quantization int8-inference bevformer tensorrt-plugins

Updated Nov 20, 2023
Python

anilsathyan7 / Portrait-Segmentation

Star

Real-time portrait segmentation for mobile devices

Updated Jan 17, 2021
Jupyter Notebook

BUG1989 / caffe-int8-convert-tools

Star

Generate a quantization parameter file for ncnn framework int8 inference

caffe ncnn deeplearning-ai quantized-neural-networks int8-inference

Updated Jul 29, 2020
Python

Improve this page

Add a description, image, and links to the int8-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the int8-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int8-inference

Here are 11 public repositories matching this topic...

ENOT-AutoDL / gpt-j-6B-tensorrt-int8

daniel-rychlewski / cnn-planesnet

ENOT-AutoDL / ENOT-transformers

yester31 / TensorRT_ONNX

whitelok / tensorrt-int8-python-sample

akashAD98 / yolov7_vino_with_object_tracking

Howell-Yang / onnx2trt

jahongir7174 / YOLOv8-qat

DerryHub / BEVFormer_tensorrt

anilsathyan7 / Portrait-Segmentation

BUG1989 / caffe-int8-convert-tools

Improve this page

Add this topic to your repo