• DocumentCode
    3231896
  • Title

    Subphonetic modeling with Markov states-Senone

  • Author

    Hwang, Mei-Yuh ; Huang, Xuedong

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburg, PA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    33
  • Abstract
    There will never be sufficient training data to model all the various acoustic-phonetic phenomena. How to capture important clues and estimate those needed parameters reliably is one of the central issues in speech recognition. Successful examples include subword models, fenones and many other smoothing techniques. In comparison with subword models, subphonetic modeling may provide a finer level of details. The authors propose to model subphonetic events with Markov states and treat the state in phonetic hidden Markov models as the basic subphonetic unit-senone. Senones generalize fenones in several ways. A word model is a concatenation of senones and senones can be shared across different word models. Senone models not only allow parameter sharing, but also enable pronunciation optimization. The authors report preliminary senone modeling results, which have significantly reduced the word error rate for speaker-independent continuous speech recognition
  • Keywords
    hidden Markov models; speech recognition; HMM; Markov states; acoustic-phonetic phenomena; fenones; hidden Markov models; parameter sharing; pronunciation optimization; senone; speaker-independent continuous speech recognition; speech recognition; subphonetic modeling; subword models; word error rate; word model; Computer science; Context modeling; Error analysis; Hidden Markov models; Iterative algorithms; Parameter estimation; Smoothing methods; Speech recognition; State estimation; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225979
  • Filename
    225979