An empirical study of automatic accent classification

Author

Choueiter, Ghinwa ; Zweig, Geoffrey ; Nguyen, Patrick

Author_Institution

Massachusetts Inst. of Technol., Cambridge, MA

fYear

2008

fDate

March 31 2008-April 4 2008

Firstpage

4265

Lastpage

4268

Abstract

This paper extends language identification (LID) techniques to a large scale accent classification task: 23-way classification of foreign-accented English. We find that a purely acoustic approach based on a combination of heteroscedastic linear discriminant analysis (HLDA) and maximum mutual information (MMI) training is very effective. In contrast to LID tasks, methods based on parallel language models prove much less effective. We focus on the Oregon Graduate Institute Foreign-Accented English dataset, and obtain a detection rate of 32%, which to our knowledge is the best reported result for 23-way accent classification.

Keywords

acoustic signal processing; natural languages; signal classification; speech recognition; statistical analysis; acoustic approach; automatic accent classification; foreign-accented English; heteroscedastic linear discriminant analysis; language identification; maximum mutual information training; parallel language model; Acoustic signal detection; Advertising; Demography; Hidden Markov models; Large-scale systems; Linear discriminant analysis; Mutual information; Natural languages; Parallel languages; Statistics; Accent classifier; GMM; Gaussian tokenization; MMI; language identification;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location

Las Vegas, NV

ISSN

1520-6149

Print_ISBN

978-1-4244-1483-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2008.4518597

Filename

4518597