Title :
Bridge deep learning to the physical world: An efficient method to quantize network
Author :
Pei-Hen Hung;Chia-Han Lee;Shao-Wen Yang;V. Srinivasa Somayazulu;Yen-Kuang Chen;Shao-Yi Chien
Author_Institution :
Media IC and System Lab, Graduate Institute of Electronics Engineering and Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan
Abstract :
As better performance is achieved by deep convolutional network with more and more layers, the increasing number of weighting and bias parameters makes it only possible to be implemented on servers in cyber space but infeasible to be deployed in physical-world embedded systems because of huge storage and memory bandwidth requirements. In this paper, we proposed an efficient method to quantize the model parameters. Instead of taking the quantization process as a negative effect on precision, we regarded it as a regularize problem to prevent overfitting, and a two-stage quantization technique including soft- and hard-quantization is developed. With the help of our quantization method, not only 93.75% of the parameter memory size can be reduced by replacing the word length from 32-bit to 2-bit, but the testing accuracy after quantization is also better than previous approaches in some dataset, and the additional training overhead is only 3% of the ordinary one.
Keywords :
"Quantization (signal)","Training","Neural networks","Machine learning","Computational modeling","Graphics processing units","Memory management"
Conference_Titel :
Signal Processing Systems (SiPS), 2015 IEEE Workshop on
DOI :
10.1109/SiPS.2015.7345005