• DocumentCode
    2403690
  • Title

    Automatic pronunciation scoring for language instruction

  • Author

    Franco, Horacio ; Neumeyer, Leonardo ; Kim, Yoon ; Ronen, Orith

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1471
  • Abstract
    This work is part of an effort aimed at developing computer-based systems for language instruction; we address the task of grading the pronunciation quality of the speech of a student of a foreign language. The automatic grading system uses SRI´s DecipherTM continuous speech recognition system to generate phonetic segmentations. Based on these segmentations and probabilistic models we produce pronunciation scores for individual or groups of sentences. Scores obtained from expert human listeners are used as the reference to evaluate the different machine scores and to provide targets when training some of the algorithms. In previous work we had found that duration-based scores outperformed HMM log-likelihood-based scores. In this paper we show that we can significantly improve HMM-based scores by using average phone segment posterior probabilities. Correlation between machine and human scores went up from r=0.50 with likelihood-based scores to r=0.88 with posterior-based scores. The new measures also outperformed duration-based scores in their ability to produce reliable scores from only a few sentences
  • Keywords
    computer aided instruction; correlation methods; hidden Markov models; probability; speech intelligibility; speech processing; speech recognition; HMM log-likelihood based scores; SRI Decipher; algorithms training; automatic grading system; automatic pronunciation scoring; average phone segment posterior probabilities; computer based systems; continuous speech recognition system; correlation; duration based scores; expert human listeners; foreign language student; language instruction; machine scores; phonetic segmentations; probabilistic models; pronunciation quality; pronunciation scores; sentences; speech quality; Algorithm design and analysis; Computer aided instruction; Databases; Feedback; Hidden Markov models; Humans; Laboratories; Natural languages; Production systems; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596227
  • Filename
    596227