• DocumentCode
    3672269
  • Title

    Efficient and accurate approximations of nonlinear convolutional networks

  • Author

    Xiangyu Zhang; Jianhua Zou; Xiang Ming;Kaiming He;Jian Sun

  • Author_Institution
    Xi´an Jiaotong University, China
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    1984
  • Lastpage
    1992
  • Abstract
    This paper aims to accelerate the test-time computation of deep convolutional neural networks (CNNs). Unlike existing methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We minimize the reconstruction error of the nonlinear responses, subject to a low-rank constraint which helps to reduce the complexity of filters. We develop an effective solution to this constrained nonlinear optimization problem. An algorithm is also presented for reducing the accumulated error when multiple layers are approximated. A whole-model speedup ratio of 4× is demonstrated on a large network trained for ImageNet, while the top-5 error rate is only increased by 0.9%. Our accelerated model has a comparably fast speed as the “AlexNet” [11], but is 4.7% more accurate.
  • Keywords
    "Approximation methods","Accuracy","Complexity theory","Computational modeling","Principal component analysis","Matrix decomposition","Acceleration"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2015.7298809
  • Filename
    7298809