• DocumentCode
    591778
  • Title

    Analysis on mispronunciations in CAPT based on computational speech perception

  • Author

    Jia Jia ; Wai-Kim Leung ; Ye Tian ; Lianhong Cai ; Meng, H.M.

  • Author_Institution
    Key Lab. of Pervasive Comput., Chinese Univ. of Hong Kong, Hong Kong, China
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    174
  • Lastpage
    178
  • Abstract
    Computer-aided Pronunciation Training (CAPT) technologies enable the use of automatic speech recognition to detect mispronunciations in second language (L2) learners´ speech. In order to further facilitate learning, we aim to be able to develop a principle-based method for generating a gradation of the severity of mispronunciations. This paper presents an approach towards gradation that is motivated by auditory perception. We have developed a computational method for generating a perceptual distance (PD) between two spoken phonemes. This is used to compute the distance between two phonemes of a target (L2) language. The PD is found to correlate well with the mispronunciations detected in CAPT system for Chinese learners of English, i.e. L1 being Chinese (Cantonese) and L2 being US English. These results indicate that auditory confusion indirectly reflects pronunciation confusions in L2 learning. The PD can also be used to help us grade the severity of errors (i.e. mispronunciations that confuse more distant phonemes are more severe) and accordingly prioritize the order of corrective feedback generated for the learners.
  • Keywords
    computer based training; natural language processing; speech recognition; CAPT technologies; Cantonese; Chinese learners; L2 learning; US English; auditory perception; automatic speech recognition; computational method; computational speech perception; computer-aided pronunciation training technologies; corrective feedback generation; mispronunciation analysis; mispronunciation detection; mispronunciation severity gradation generation; perceptual distance generation; principle-based method; spoken phonemes; Correlation; Educational institutions; Feature extraction; Laboratories; Speech; Speech recognition; Training; computational speech perception; computer-aided pronunciation training; mispronunciation; second language learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423530
  • Filename
    6423530