• DocumentCode
    284576
  • Title

    Speech recognition using segmental neural nets

  • Author

    Austin, S. ; Zavaliagkos, G. ; Makhoul, J. ; Schwartz, R.

  • Author_Institution
    BNN Syst. & Technol., Cambridge, MA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    625
  • Abstract
    The authors present the concept of a segmental neural net (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how this can be used with a multiple hypothesis (or N -Best) paradigm to combine different CSR systems. In particular, the authors developed a system that combines the SNN with a hidden Markov model (HMM) system. In a speaker-independent, 1000-word CSR test using a word-pair grammar, the error rate for the hybrid system dropped 25% from that of a state-of-the-art HMM system alone. By taking into account all the frames of a phonetic segment simultaneously, the SNN overcomes the well-known conditional-independence limitation of HMMs. The hybrid SNN/HMM system generates likely phonetic segmentations from the HMM N-best list, which are scored by the SNN. The HMM and SNN scores are then combined to optimize performance
  • Keywords
    error statistics; hidden Markov models; neural nets; speech recognition; CSR systems; continuous speech recognition; error rate; hidden Markov model; hybrid SNN/HMM system; phonetic modeling; segmental neural net; Availability; Computational efficiency; Context modeling; Error analysis; Hidden Markov models; Neural networks; Signal processing; Speech recognition; Sun; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225831
  • Filename
    225831