A PTQ tflite model fails to pass benchmark test #95

liamsun2019 · 2022-07-12T01:44:23Z

My use case:
Apply post training quantization to a pth model and convert to tflite. The generated tflite model fails to pass benchmark test with following error message:
STARTING!
Log parameter values verbosely: [0]
Graph: [out/ptq_model.tflite]
Loaded model out/ptq_model.tflite
ERROR: tensorflow/lite/kernels/concatenation.cc:179 t->params.scale != output->params.scale (3 != -657359264)
ERROR: Node number 154 (CONCATENATION) failed to prepare.
Failed to allocate tensors!
Benchmarking failed.

Pls refer to the attachment. Thanks.
test.zip

liamsun2019 · 2022-07-12T01:46:14Z

My quantization strategy:
quantizer = PostQuantizer(model, dummy_input, work_dir='out', config={'force_overwrite': True, 'rewrite_graph': True, 'is_input_quantized': None, 'asymmetric': False, 'per_tensor': False})
。。。。。。。。。。。。。。。。。。。。。。。。。。
converter = TFLiteConverter(ptq_model, dummy_input, tflite_path='out/ptq_model.tflite', strict_symmetric_check=False, quantize_target_type='int8')

liamsun2019 · 2022-07-12T02:11:00Z

The following strategy works:
quantizer = PostQuantizer(model, dummy_input, work_dir='out', config={'force_overwrite': True, 'rewrite_graph': True, 'is_input_quantized': None, 'asymmetric': True, 'per_tensor': True})
。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。
converter = TFLiteConverter(ptq_model, dummy_input, tflite_path='out/ptq_model.tflite', strict_symmetric_check=False, quantize_target_type='uint8')

liamsun2019 · 2022-07-12T02:13:05Z

Looks int8 per-channel quantization may incur errors.

peterjc123 · 2022-07-14T02:56:14Z

The following pattern in your model is the root cause of the problem.

A = sigmoid(X)
B = cat(A, Y)

The output tensor of the sigmoid op has a constant quantization parameter. There are several ways to get it fixed.

Unify the quantization parameters of (Y, B) to the quantization parameters of A and also we need to disable the observers in those variables.
Insert requantization after A, so that we have

A = sigmoid(X)
A_ = requantize(A)
B = cat(A_, Y)

Then, we may just unify the quantization parameters of (A_, Y, B), just like what we do as usual.

peterjc123 · 2022-07-14T06:32:38Z

Or you may just skip the quantization for this kind of pattern, which seems to be the simplest solution.

peterjc123 · 2023-09-20T03:54:56Z

Unify the quantization parameters of (Y, B) to the quantization parameters of A and also we need to disable the observers in those variables.

This is simpler I guess. We will try to fix it this way.

peterjc123 added the bug Something isn't working label Jul 13, 2022

peterjc123 mentioned this issue Sep 6, 2023

Concatenation has inconsistent input parameters quantization parameters #221

Closed

peterjc123 added the work/x-small work that can be done within 3 hour label Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A PTQ tflite model fails to pass benchmark test #95

A PTQ tflite model fails to pass benchmark test #95

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

peterjc123 commented Jul 14, 2022

peterjc123 commented Jul 14, 2022

peterjc123 commented Sep 20, 2023

A PTQ tflite model fails to pass benchmark test #95

A PTQ tflite model fails to pass benchmark test #95

Comments

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

liamsun2019 commented Jul 12, 2022

peterjc123 commented Jul 14, 2022

peterjc123 commented Jul 14, 2022

peterjc123 commented Sep 20, 2023