Skip to content
This repository has been archived by the owner on Aug 5, 2022. It is now read-only.

weight quantization #283

Open
minhson opened this issue May 14, 2019 · 2 comments
Open

weight quantization #283

minhson opened this issue May 14, 2019 · 2 comments

Comments

@minhson
Copy link

minhson commented May 14, 2019

Hello!
I am new to Intel Caffe!
As i read Intel document "LOWER NUMERICAL PRECISION DEEP LEARNING INFERENCE AND TRAINING". It said that "quantizing the weights is done before inference starts. Quantizing the activations efficiently requires precomputing the quantization factors".
However, when i use Calibrator tool, i just get the quantized prototxt. I don't know where the weights is quantized.

Could you show me where the weights is quantized?
Thanks you so much!

@hshen14
Copy link
Contributor

hshen14 commented May 31, 2019

You can find "scale_params" in the quantized prototxt

@minhson
Copy link
Author

minhson commented Jun 12, 2019

You can find "scale_params" in the quantized prototxt

yes, i saw it.
do we have to quantize the weight firstly before running inference? or the weight is quantized through reorder primitive?
thanks you!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants