• DocumentCode
    730837
  • Title

    Automatic assessment of English learner pronunciation using discriminative classifiers

  • Author

    Nicolao, Mauro ; Beeston, Amy V. ; Hain, Thomas

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sheffield, Sheffield, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5351
  • Lastpage
    5355
  • Abstract
    This paper presents a novel system for automatic assessment of pronunciation quality of English learner speech, based on deep neural network (DNN) features and phoneme specific discriminative classifiers. DNNs trained on a large corpus of native and non-native learner speech are used to extract phoneme posterior probabilities. A part of the corpus includes per phone teacher annotations, which allows training of two Gaussian Mixture Models (GMM), representing correct pronunciations and typical error patterns. The likelihood ratio is then obtained for each observed phone. Several models were evaluated on a large corpus of English-learning students, with a variety of skill levels, and aged 13 upwards. The cross-correlation of the best system and average human annotator reference scores is 0.72, with miss and false alarm rate around 19%. Automatic assessment is 81.6% correct with a high degree of confidence. The new approach significantly outperforms spectral distance based baseline systems.
  • Keywords
    Gaussian processes; mixture models; neural nets; speech recognition; DNN features; GMM; Gaussian mixture models; automatic assessment; deep neural network; english learner pronunciation; english learner speech; english-learning students; human annotator reference scores; likelihood ratio; nonnative learner speech; per phone teacher annotations; phoneme posterior probability; phoneme specific discriminative classifiers; pronunciation quality; Acoustics; Feature extraction; Neural networks; Nickel; Regression tree analysis; Speech; Training; Computer-Assisted Language Learning; DNN-GMM; Pronunciation assessment; binary classifier;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178993
  • Filename
    7178993