Title :
Machine learning approaches to improving pronunciation error detection on an imbalanced corpus
Author :
Xuesong Yang ; Loukina, Anastassia ; Evanini, Keelan
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
In this paper, we investigate the task of phone-level pronunciation error detection as a binary classification problem, the performance of which is heavily affected by the imbalanced distribution of the classes in a manually annotated data set of non-native English. In order to address problems caused by this extreme class imbalance, methods for cost-sensitive learning (weighting inversely proportional to class frequencies) and over-sampling of synthetic instances (SMOTE) are investigated in order to improve classification performance. Experiments using classifiers consisting of features based on acoustic phonetics and word identity demonstrate that these machine learning approaches lead to performance improvements over the baseline system based on the extremely imbalanced data. In addition, several different types of classifiers were compared. Finally, the paper analyzes the robustness of classifier performance across different phones.
Keywords :
computer based training; learning (artificial intelligence); pattern classification; sampling methods; speech processing; SMOTE; acoustic phonetics; binary classification problem; cost-sensitive learning; imbalanced corpus; machine learning approaches; nonnative English; phone-level pronunciation error detection; pronunciation error detection; synthetic instances over-sampling; word identity; Abstracts; Acoustics; Pragmatics; Speech; Imbalanced Learning; Pronunciation Error Detection; Sampling Methods; Spoken Language Assessment;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078591