You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to compute conv with uint8 activations and int8 weight data to get int32 output result, the quantization and dequantization process is before and after the fbgemmConv call.
The layouts of my uint8 activations and int8 weight are NCHW and (OC, IC, KH, KW) respectively.
After took a closer look at the example code under bench/ConvUnifiedBenchmark.cc, I'm a little confused about the call flow, input and output data layout and call parameters.
According to the context of the code, I guess the layout of Aint8 should be NHWC, the layout of Bint8 is KRSC i.e., (OC, H, W, IC), and the value of Aint8_zero_point shout be the activation padding value.
There are some doubts about above code snippet:
What's the meaning and specific value of C_multiplier, C_zero_point, Bint8_zero_point, and col_offsets, and how to set these values in my conv quantization code when call fbgemmConv?
What is the specific layout of input and output data? Whether other layouts are supported?
What is the point of calling row_offsets_u8acc32_ref and requantize_u8acc32_ref, when the quantization and dequantization process is done outside.
Thanks!
The text was updated successfully, but these errors were encountered:
Hello.
I want to compute conv with uint8 activations and int8 weight data to get int32 output result, the quantization and dequantization process is before and after the fbgemmConv call.
The layouts of my uint8 activations and int8 weight are NCHW and (OC, IC, KH, KW) respectively.
After took a closer look at the example code under bench/ConvUnifiedBenchmark.cc, I'm a little confused about the call flow, input and output data layout and call parameters.
The example code snippet in bench/ConvUnifiedBenchmark.cc is as follows:
According to the context of the code, I guess the layout of
Aint8
should be NHWC, the layout ofBint8
is KRSC i.e., (OC, H, W, IC), and the value ofAint8_zero_point
shout be the activation padding value.There are some doubts about above code snippet:
C_multiplier
,C_zero_point
,Bint8_zero_point
, andcol_offsets
, and how to set these values in my conv quantization code when call fbgemmConv?row_offsets_u8acc32_ref
andrequantize_u8acc32_ref
, when the quantization and dequantization process is done outside.Thanks!
The text was updated successfully, but these errors were encountered: