DocumentCode
3423363
Title
An empirical study of automatic accent classification
Author
Choueiter, Ghinwa ; Zweig, Geoffrey ; Nguyen, Patrick
Author_Institution
Massachusetts Inst. of Technol., Cambridge, MA
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4265
Lastpage
4268
Abstract
This paper extends language identification (LID) techniques to a large scale accent classification task: 23-way classification of foreign-accented English. We find that a purely acoustic approach based on a combination of heteroscedastic linear discriminant analysis (HLDA) and maximum mutual information (MMI) training is very effective. In contrast to LID tasks, methods based on parallel language models prove much less effective. We focus on the Oregon Graduate Institute Foreign-Accented English dataset, and obtain a detection rate of 32%, which to our knowledge is the best reported result for 23-way accent classification.
Keywords
acoustic signal processing; natural languages; signal classification; speech recognition; statistical analysis; acoustic approach; automatic accent classification; foreign-accented English; heteroscedastic linear discriminant analysis; language identification; maximum mutual information training; parallel language model; Acoustic signal detection; Advertising; Demography; Hidden Markov models; Large-scale systems; Linear discriminant analysis; Mutual information; Natural languages; Parallel languages; Statistics; Accent classifier; GMM; Gaussian tokenization; MMI; language identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518597
Filename
4518597
Link To Document