Quantizing deep convolutional networks for efficient inference: A whitepaper


Overview


Overview of techniques for quantizing convolutional neural networks for inference with integer weights and activations.

Quantizer Design


Uniform Affine Quantizer


Uniform symmetric quantize


Stochastic quantizer : We do not consider stochastic quantization for inference