Skip to content

Quantizing with accuracy control for CNN autoencoder models #2639

Answered by alexsu52
korotaS asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @korotaS,

As practice has shown, model metric is more robust metric to rank quantized operation. If model metric is not available then proxy metric should be used. We usually use MSE as default proxy metric as well. Choose proxy metric is magic and I don't have general recommendation. Based on algorithm working I can share some insides:

  • Validation dataset should include hard cases when quantized model gives bad result.
  • As the first step, the quantize with accuracy control algorithm calculates the impact of each quantized operation on the model metric, by calculating the metric of model in which the target quantized operation is returned to the original precision. Thus If you add print…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@korotaS
Comment options

Answer selected by korotaS
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants