Training high-performance and large-scale deep neural networks with full 8-bit integers

Yukuan Yang; Lei Deng; Shuang Wu; Tianyi Yan; Yuan Xie; Guoqi Li

doi:10.1016/j.neunet.2019.12.027

Training high-performance and large-scale deep neural networks with full 8-bit integers

Yukuan Yang, Lei Deng, Shuang Wu, Tianyi Yan^*, Yuan Xie, Guoqi Li

^*此作品的通讯作者

医学技术学院

科研成果: 期刊稿件 › 文章 › 同行评审

83 引用（Scopus）

摘要

Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as “WAGEUBN” to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing towards faster processing speed, decreased memory cost, and higher energy efficiency. Our throughout quantization framework has great potential for future efficient portable devices with online learning ability.

源语言	英语
页（从-至）	70-82
页数	13
期刊	Neural Networks
卷	125
DOI	https://doi.org/10.1016/j.neunet.2019.12.027
出版状态	已出版 - 5月 2020

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1016/j.neunet.2019.12.027

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{5e98db0ca81c4f9f8eed1de551e4c2e1,

title = "Training high-performance and large-scale deep neural networks with full 8-bit integers",

abstract = "Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as “WAGEUBN” to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing towards faster processing speed, decreased memory cost, and higher energy efficiency. Our throughout quantization framework has great potential for future efficient portable devices with online learning ability.",

keywords = "8-bit training, Full quantization, Neural network quantization, Online learning device",

author = "Yukuan Yang and Lei Deng and Shuang Wu and Tianyi Yan and Yuan Xie and Guoqi Li",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Ltd",

year = "2020",

month = may,

doi = "10.1016/j.neunet.2019.12.027",

language = "English",

volume = "125",

pages = "70--82",

journal = "Neural Networks",

issn = "0893-6080",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Training high-performance and large-scale deep neural networks with full 8-bit integers

AU - Yang, Yukuan

AU - Deng, Lei

AU - Wu, Shuang

AU - Yan, Tianyi

AU - Xie, Yuan

AU - Li, Guoqi

PY - 2020/5

Y1 - 2020/5

N2 - Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as “WAGEUBN” to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing towards faster processing speed, decreased memory cost, and higher energy efficiency. Our throughout quantization framework has great potential for future efficient portable devices with online learning ability.

AB - Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as “WAGEUBN” to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing towards faster processing speed, decreased memory cost, and higher energy efficiency. Our throughout quantization framework has great potential for future efficient portable devices with online learning ability.

KW - 8-bit training

KW - Full quantization

KW - Neural network quantization

KW - Online learning device

UR - http://www.scopus.com/inward/record.url?scp=85079516063&partnerID=8YFLogxK

U2 - 10.1016/j.neunet.2019.12.027

DO - 10.1016/j.neunet.2019.12.027

M3 - Article

C2 - 32070857

AN - SCOPUS:85079516063

SN - 0893-6080

VL - 125

SP - 70

EP - 82

JO - Neural Networks

JF - Neural Networks

ER -

Training high-performance and large-scale deep neural networks with full 8-bit integers

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此