• DocumentCode
    1912352
  • Title

    Improved acoustic modeling with the SPHINX speech recognition system

  • Author

    Huang, X.D. ; Lee, K.F. ; Hon, H.W. ; Hwang, M.Y.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    345
  • Abstract
    The authors report recent efforts to further improve the performance of the SPHINX system for speaker-independent continuous speech recognition. They adhere to the basic architecture of the SPHINX system and use the DARPA resource management task and training corpus. The improvements are evaluated on the 600 sentences that comprise the DARPA February and October 1989 test sets. Several techniques that substantially reduced SPHINX´s error rate are presented. These techniques include dynamic features, semicontinuous hidden Markov models, speaker clustering, and the shared distribution modeling. The error rate of the baseline system was reduced by 45%
  • Keywords
    speech recognition; DARPA resource management task and training corpus; SPHINX speech recognition system; dynamic features; error rate reduction; improved acoustic modelling; semicontinuous hidden Markov models; shared distribution modeling; speaker clustering; speaker-independent continuous speech recognition; Cepstrum; Computer architecture; Computer science; Error analysis; Hidden Markov models; Linear predictive coding; Loudspeakers; Resource management; Smoothing methods; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150347
  • Filename
    150347