• DocumentCode
    730848
  • Title

    Bidirectional recurrent neural network language models for automatic speech recognition

  • Author

    Arisoy, Ebru ; Sethy, Abhinav ; Ramabhadran, Bhuvana ; Chen, Stanley

  • Author_Institution
    IBM Turkey, Istanbul, Turkey
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5421
  • Lastpage
    5425
  • Abstract
    Recurrent neural network language models have enjoyed great success in speech recognition, partially due to their ability to model longer-distance context than word n-gram models. In recurrent neural networks (RNNs), contextual information from past inputs is modeled with the help of recurrent connections at the hidden layer, while Long Short-Term Memory (LSTM) neural networks are RNNs that contain units that can store values for arbitrary amounts of time. While conventional unidirectional networks predict outputs from only past inputs, one can build bidirectional networks that also condition on future inputs. In this paper, we propose applying bidirectional RNNs and LSTM neural networks to language modeling for speech recognition. We discuss issues that arise when utilizing bidirectional models for speech, and compare unidirectional and bidirectional models on an English Broadcast News transcription task. We find that bidirectional RNNs significantly outperform unidirectional RNNs, but bidirectional LSTMs do not provide any further gain over their unidirectional counterparts.
  • Keywords
    natural language processing; recurrent neural nets; speech recognition; LSTM neural network; RNN; automatic speech recognition; bidirectional recurrent neural network language model; english broadcast news transcription task; long short-term memory neural network; Computational modeling; Logic gates; Mathematical model; Recurrent neural networks; Speech recognition; Training; Language modeling; bidirectional neural networks; long short term memory; recurrent neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7179007
  • Filename
    7179007