• DocumentCode
    3523253
  • Title

    Automatic generation of phonetic units for continuous speech recognition

  • Author

    Pieraccini, Roberto ; Rosenberg, Aaron E.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    623
  • Abstract
    Several techniques for dealing with the variability of recognition units are reported. The segmental k-means technique has been used for estimating two-state hidden Markov models for each of an inventory of 46 phones. Multiple models of these phones have been generated using clustering techniques intended to model separately acoustically distinct phenomena associated with the same unit. It is shown that increasing the number of models per phone can significantly increase the performance of a speaker dependent continuous speech recognizer, especially if model weights for phones which reflect their contexts in words found in the test vocabulary can be obtained. The authors feel that model splitting could be even more important in speaker dependent recognition, where it can be used to reduce model variability associated with multiple speakers as well as variable contexts. The results suggest that adequate training data must be available to train multiple models; otherwise, performance degrades when the number of models increases
  • Keywords
    speech recognition; automatic generation; clustering techniques; continuous speech recognition; model weights; multiple models; phones; phonetic units; recognition units; segmental k-means technique; speaker dependent continuous speech recognizer; speaker dependent recognition; test vocabulary; training data; two-state hidden Markov models; Clustering algorithms; Context modeling; Databases; Hidden Markov models; Ice; Peak to average power ratio; Prognostics and health management; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266504
  • Filename
    266504