• DocumentCode
    180402
  • Title

    Fine context, low-rank, softplus deep neural networks for mobile speech recognition

  • Author

    Senior, Alan ; Xin Lei

  • Author_Institution
    Google Inc., Mountain View, CA, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    7644
  • Lastpage
    7648
  • Abstract
    We investigate the use of large state inventories and the softplus nonlinearity for on-device neural network based mobile speech recognition. Large state inventories are achieved by less aggressive context-dependent state tying, and made possible by using a bottleneck layer to contain the number of parameters. We investigate alternative approaches to the bottleneck layer, demonstrate the superiority of the softplus non-linearity and investigate alternatives for the final stages of the training algorithm. Overall we reduce the word error rate of the system by 9% relative. The techniques are also shown to work well for large acoustic models for cloud-based speech recognition.
  • Keywords
    mobile computing; neural nets; speech recognition; acoustic models; bottleneck layer; cloud based speech recognition; fine context; large state inventories; low-rank; mobile speech recognition; softplus deep neural networks; softplus nonlinearity; training algorithm; word error rate; Acoustics; Approximation methods; Error analysis; Neural networks; Speech; Speech recognition; Training; Deep neural networks; Voice Search; embedded recognizer; hybrid neural network speech recognition; low-rank approximation; mobile speech recognition; singular value decomposition; softplus nonlinearity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6855087
  • Filename
    6855087