Centered Symmetric Quantization for Hardware-Efficient Low-Bit Neural Networks


Faaiz Asim (Ulsan National Institute of Sience and Technology (UNIST)),* Jaewoo Park (UNIST), Azat Azamat (UNIST), Jongeun Lee (UNIST)
The 33rd British Machine Vision Conference

Abstract

In extremely low-precision quantization, unequal numbers of positive and negative quantization levels seem quite sub-optimal when dealing with symmetrical data distribution such as weight parameters of a neural network. In this paper, based on an observation that a significant amount of quantization error can be caused by a quantizer with unequal vs. equal numbers of quantization levels, we propose a quantizer that has perfectly zero-centered quantization levels for weight quantization, dubbed Centered Symmetric Quantization (CSQ), with an analysis and empirical quantification of why and how much performance gain CSQ can provide over conventional linear quantization (CLQ). Moreover, noting that it is tricky to implement n-bit CSQ using just n-bit arithmetic hardware, we also propose efficient methods of implementing CSQ using (i) standard multiplication hardware and (ii) bit-wise binarized neural-net hardware. Our experimental results using state-of-the-art quantization-aware training methods on ResNets and MobileNet-v2 show that using CSQ for weight in place of CLQ does offer significant performance advantage at extremely low-bit precision (2~3 bits) without any considerable overhead.

Video



Citation

@inproceedings{Asim_2022_BMVC,
author    = {Faaiz Asim and Jaewoo Park and Azat Azamat and Jongeun Lee},
title     = {Centered Symmetric Quantization for Hardware-Efficient Low-Bit Neural Networks},
booktitle = {33rd British Machine Vision Conference 2022, {BMVC} 2022, London, UK, November 21-24, 2022},
publisher = {{BMVA} Press},
year      = {2022},
url       = {https://bmvc2022.mpi-inf.mpg.de/0538.pdf}
}


Copyright © 2022 The British Machine Vision Association and Society for Pattern Recognition
The British Machine Vision Conference is organised by The British Machine Vision Association and Society for Pattern Recognition. The Association is a Company limited by guarantee, No.2543446, and a non-profit-making body, registered in England and Wales as Charity No.1002307 (Registered Office: Dept. of Computer Science, Durham University, South Road, Durham, DH1 3LE, UK).

Imprint | Data Protection