weight quantization #283

minhson · 2019-05-14T01:15:28Z

Hello!
I am new to Intel Caffe!
As i read Intel document "LOWER NUMERICAL PRECISION DEEP LEARNING INFERENCE AND TRAINING". It said that "quantizing the weights is done before inference starts. Quantizing the activations efficiently requires precomputing the quantization factors".
However, when i use Calibrator tool, i just get the quantized prototxt. I don't know where the weights is quantized.

Could you show me where the weights is quantized?
Thanks you so much!

hshen14 · 2019-05-31T09:05:29Z

You can find "scale_params" in the quantized prototxt

minhson · 2019-06-12T00:46:58Z

You can find "scale_params" in the quantized prototxt

yes, i saw it.
do we have to quantize the weight firstly before running inference? or the weight is quantized through reorder primitive?
thanks you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weight quantization #283

weight quantization #283

minhson commented May 14, 2019

hshen14 commented May 31, 2019

minhson commented Jun 12, 2019

weight quantization #283

weight quantization #283

Comments

minhson commented May 14, 2019

hshen14 commented May 31, 2019

minhson commented Jun 12, 2019