• DocumentCode
    394343
  • Title

    A phone recognizer helps to recognize words better

  • Author

    Stemmer, Georg ; Zeissler, Viktor ; Hacker, Christian ; Noth, Elmar ; Niemann, Heinrich

  • Author_Institution
    Lehrstuhl fur Mustererkennung (Informatik 5), Erlangen-Nurnberg Univ., Erlangen, Germany
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    For most speech recognition systems dynamic features are the only way to incorporate temporal context into the output distributions of the HMM. In this paper we propose an efficient method to utilize a large context in the recognition process. State scores of a phone recognizer which runs in parallel to the word recognizer are computed. Integrating these scores in the HMM of the word recognizer makes their output densities context-dependent. The approach is evaluated on a set of spontaneous utterances which have been recorded with our spoken dialogue system. A significant reduction of the word error rate has been achieved.
  • Keywords
    error statistics; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; context-dependent output densities; dynamic features; large context; output distributions; phone recognizer; speech recognition; spontaneous utterances; state scores; temporal context; word error rate; word recognizer; Cepstral analysis; Computer hacking; Concurrent computing; Data mining; Error analysis; Feature extraction; Hidden Markov models; Pattern recognition; Random variables; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198886
  • Filename
    1198886