مرکز منطقه ای اطلاع رساني علوم و فناوري - Gaussian Mixture Modeling of vowel durations for automated assessment of non-native speech

DocumentCode :

2181558

Title :

Gaussian Mixture Modeling of vowel durations for automated assessment of non-native speech

Author :

Sun, Xie ; Evanini, Keelan

Author_Institution :

Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

5716

Lastpage :

5719

Abstract :

This paper investigates using Gaussian Mixture Model (GMM) based vowel duration features for automated assessment of non-native speech. Two different types of models were compared: a single GMM trained on a reference corpus of native speech and separate GMMs for different proficiency levels trained on a large corpus of scored non-native speech. 13 vowel categories were evaluated separately (after normalization by rate of speech), and a multiple regression model was used to evaluate the performance of all vowel categories combined. Experiments on an English language proficiency assessment show that the non-native speech GMMs outperform the native speech GMMs, and that all 13 vowels have significant correlations with human scores when the non-native speech GMMs are used. The multiple regression combination obtained correlations with human scores of 0.71 when transcriptions were used to extract the vowel durations and 0.64 when the Automatic Speech Recognition (ASR) output was used. The experiments demonstrate that the vowel duration feature based on non-native speech GMMs is a useful predictor of L2 proficiency and is robust to different datasets and situations.

Keywords :

Gaussian processes; speech recognition; ASR; Gaussian mixture modeling; automatic speech recognition; multiple regression model; nonnative speech GMM; nonnative speech automated assessment; performance evaluation; vowel durations; Correlation; Erbium; Humans; Speech; Speech recognition; Training; Training data; Computer Assisted Language Learning; Gaussian Mixture Model; automated pronunciation assessment; vowel duration;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947658

Filename :

5947658

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2181558