DocumentCode :
1695101
Title :
Toward unsupervised discovery of pronunciation error patterns using universal phoneme posteriorgram for computer-assisted language learning
Author :
Yow-Bang Wang ; Lin-Shan Lee
Author_Institution :
Grad. Inst. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2013
Firstpage :
8232
Lastpage :
8236
Abstract :
In Computer-Aided Pronunciation Training, we hope to specify the type of mispronunciation, or Error Pattern (EP), the language learner has made as a more effective feedback. But derivation of EPs usually requires expert knowledge and pedagogical experiences, which is not easy to obtain for each pair of target and native languages. In this paper we propose a preliminary framework toward unsupervised discovery of EPs from a corpus of learners´ recordings. We use Universal Phoneme Posteriorgram, derived from Multi-Layer Perceptron trained with a corpus of mixed languages, as features to bring supervised knowledge into the unsupervised task. We also use Hierarchical Agglomerative Clustering algorithm to explore sub-segmental variation of phoneme segments for distinguishing EPs. We tested K-means (assuming known number of EPs) and Gaussian Mixture Model with minimum description length principle (estimating unknown number of EPs) for EP discovery. Preliminary experimental results illustrated the effectiveness of the proposed framework, although there is still a long way to go compared to human annotators.
Keywords :
computer aided instruction; multilayer perceptrons; natural language processing; Gaussian mixture model; computer aided pronunciation training; computer assisted language learning; hierarchical agglomerative clustering algorithm; human annotators; language learner; mispronunciation; multilayer perceptron; native languages; pedagogical experiences; phoneme segments; pronunciation error patterns; supervised knowledge; universal phoneme posteriorgram; unsupervised discovery; unsupervised task; Acoustics; Clustering algorithms; Feature extraction; Indexes; Speech; Training; Vectors; GMM-MDL; HAC; K-means; Pronunciation Error Pattern Discovery; Rand Index; Universal Phoneme Posteriorgram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639270
Filename :
6639270
Link To Document :
بازگشت