• DocumentCode
    1934448
  • Title

    A statistical model for generating pronunciation networks

  • Author

    Riley, Michael D.

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    737
  • Abstract
    Methods to predict detailed phonetic pronunciations from a coarse phonemic transcription are described. The phonemic base forms, obtainable from orthographic text by dictionary lookup and other means, do not specify fine phonetic detail such as flapping, glottal stop insertion, or the formation of syllabic nasals and liquids. These phenomena depend on the phonetic context (often spanning word boundaries), stress environment, speaking rate, and dialect. A procedure is presented that builds decision trees, trained on the TIMIT database, using some of these features to predict pronunciation alternatives. The resulting phonetic network predicts the correct pronunciation of a phoneme on test data from the same corpus approximately 83% of the time and the correct phone was in the top five guesses 99% of the time
  • Keywords
    filtering and prediction theory; speech analysis and processing; statistical analysis; trees (mathematics); TIMIT database; coarse phonemic transcription; decision trees; dialect; dictionary lookup; flapping; glottal stop insertion; orthographic text; phonemic base forms; phonetic context; phonetic pronunciation prediction; pronunciation networks; speaking rate; speech analysis; statistical model; stress environment; syllabic liquids; syllabic nasals; Dairy products; Databases; Dictionaries; Microwave integrated circuits; Predictive models; Read only memory; Stress;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150446
  • Filename
    150446