• DocumentCode
    290124
  • Title

    IPA: improved phone modelling with recurrent neural networks

  • Author

    Robinson, Tony ; Hochberg, Mike ; Renals, Steve

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    This paper describes phone modelling improvements to the hybrid connectionist-hidden Markov model speech recognition system developed at Cambridge University. These improvements are applied to phone recognition from the TIMIT task and word recognition from the Wall Street Journal (WSJ) task. A recurrent net is used to map acoustic vectors to posterior probabilities of phone classes. The maximum likelihood phone or word string is then extracted using Markov models. The paper describes three improvements: connectionist model merging; explicit presentation of acoustic context; and improved duration modelling. The first is shown to provide a significant improvement in the TIMIT phone recognition rate and all three provide an improvement in the WSJ word recognition rate
  • Keywords
    hidden Markov models; maximum likelihood estimation; probability; recurrent neural nets; speech recognition; Cambridge University; Markov models; TIMIT task; Wall Street Journal; acoustic context presentation; acoustic vectors; connectionist model merging; duration modelling; hidden Markov model; maximum likelihood method; phone modelling; phone recognition rate; probabilities; recurrent neural networks; speech recognition system; word recognition rate; Acoustic applications; Context modeling; Hidden Markov models; Maximum likelihood decoding; Maximum likelihood estimation; Merging; Recurrent neural networks; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389361
  • Filename
    389361