How to perform int8 quantisation (not uint8) using ONNX? #1610

paul-ang · 2024-02-16T03:18:32Z

Hi team, I am having issue quantizing the network consisting of Conv and Linear layers using int8 weights and activations in ONNX. I have tried setting it using op_type_dict, however it doesn't work. The activation is still using uint8. I am using version 2.3.1 neural compressor.

mengniwang95 · 2024-02-19T07:38:17Z

Hi @paul-ang , we only support U8S8 by default because on x86-64 machines with AVX2 and AVX512 extensions, ONNX Runtime uses the VPMADDUBSW instruction for U8S8 for performance. I am so sorry you need to update the code by yourself to use S8S8. Please add 'int8' in activations' dtype list: https://github.com/intel/neural-compressor/blob/master/neural_compressor/adaptor/onnxrt.yaml.
We will enhance it in our 3.0 API.

chensuyue assigned mengniwang95 Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to perform int8 quantisation (not uint8) using ONNX? #1610

How to perform int8 quantisation (not uint8) using ONNX? #1610

paul-ang commented Feb 16, 2024 •

edited

mengniwang95 commented Feb 19, 2024

How to perform int8 quantisation (not uint8) using ONNX? #1610

How to perform int8 quantisation (not uint8) using ONNX? #1610

Comments

paul-ang commented Feb 16, 2024 • edited

mengniwang95 commented Feb 19, 2024

paul-ang commented Feb 16, 2024 •

edited