Site de Vincent Gripon

Blog sur mes recherches et mon enseignement

BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization

M. Nikolić, G. B. Hacene, C. Bannon, A. D. Lascorz, M. Courbariaux, Y. Bengio, V. Gripon et A. Moshovos, "BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization," dans ArXiv Preprint, 2020.

Neural networks have demonstrably achieved state-of-the art accuracy using low-bitlength integer quantization, yielding both execution time and energy benefits on existing hardware designs that support short bitlengths. However, the question of finding the minimum bitlength for a desired accuracy remains open. We introduce a training method for minimizing inference bitlength at any granularity while maintaining accuracy. Namely, we propose a regularizer that penalizes large bitlength representations throughout the architecture and show how it can be modified to minimize other quantifiable criteria, such as number of operations or memory footprint. We demonstrate that our method learns thrifty representations while maintaining accuracy. With ImageNet, the method produces an average per layer bitlength of 4.13, 3.76 and 4.36 bits on AlexNet, ResNet18 and MobileNet V2 respectively, remaining within 2.0%, 0.5% and 0.5% of the base TOP-1 accuracy.

Télécharger le manuscrit.

Bibtex
@inproceedings{NikHacBanLasCouBenGriMos2020,
  author = {Miloš Nikolić and Ghouthi Boukli Hacene
and Ciaran Bannon and Alberto Delmas Lascorz and
Matthieu Courbariaux and Yoshua Bengio and Vincent
Gripon and Andreas Moshovos},
  title = {BitPruning: Learning Bitlengths for
Aggressive and Accurate Quantization},
  booktitle = {ArXiv Preprint},
  year = {2020},
}




Vous êtes le 1975734ème visiteur

Site de Vincent Gripon