• DocumentCode
    2612140
  • Title

    Adaptive frequency cepstral coefficients for word mispronunciation detection

  • Author

    Ge, Zhenhao ; Sharma, Sudhendu R. ; Smith, Mark J T

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
  • Volume
    5
  • fYear
    2011
  • fDate
    15-17 Oct. 2011
  • Firstpage
    2388
  • Lastpage
    2391
  • Abstract
    Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students studying foreign languages. Here we propose a Hidden Markov Model (HMM)-based method to detect mispronunciations. Exploiting the specific dialog scripting employed in language learning software, HMMs are trained for different pronunciations. New adaptive features have been developed and obtained through an adaptive warping of the frequency scale prior to computing the cepstral coefficients. The optimization criterion used for the warping function is to maximize separation of two major groups of pronunciations (native and non-native) in terms of classification rate. Experimental results show that the adaptive frequency scale yields a better coefficient representation leading to higher classification rates in comparison with conventional HMMs using Mel-frequency cepstral coefficients.
  • Keywords
    cepstral analysis; hidden Markov models; natural language processing; optimisation; speech recognition; Mel-frequency cepstral coefficients; adaptive frequency cepstral coefficients; automatic speech recognition technology; computer assisted language learning applications; dialog scripting; foreign languages; frequency scale adaptive warping; hidden Markov model based method; optimization criterion; word mispronunciation detection; Hidden Markov models; Humans; Interpolation; Mel frequency cepstral coefficient; Optimization; Training; AFCC; ASR; Frequency scale; MFCC; Mispronunciation detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2011 4th International Congress on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-9304-3
  • Type

    conf

  • DOI
    10.1109/CISP.2011.6100685
  • Filename
    6100685