Why do the weights need to be dequantized after one quantization? #23

lotusdaddy · 2022-07-12T07:42:18Z

   new_quant_x = linear_quantize(x, scale, zero_point, inplace=False)
    n = 2**(k - 1)
    new_quant_x = torch.clamp(new_quant_x, -n, n - 1)
    quant_x = linear_dequantize(new_quant_x,
                                scale,
                                zero_point,
                                inplace=False)

Doesn't this get the weight of the floating point number?

The text was updated successfully, but these errors were encountered:

Minato-Zackie · 2022-09-26T03:40:30Z

From my point of view, most of the quantization papers, the code is using fake quantization operation to simulate quantization. So it's still using floating-point numbers for quantization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why do the weights need to be dequantized after one quantization? #23

Why do the weights need to be dequantized after one quantization? #23

lotusdaddy commented Jul 12, 2022

Minato-Zackie commented Sep 26, 2022

Why do the weights need to be dequantized after one quantization? #23

Why do the weights need to be dequantized after one quantization? #23

Comments

lotusdaddy commented Jul 12, 2022

Minato-Zackie commented Sep 26, 2022