Title :
A novel approach to detecting non-native speakers and their native language
Author :
Omar, Mohamed Kamal ; Pelecanos, Jason
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Speech contains valuable information regarding the traits of speakers. This paper investigates two aspects of this information. The first is automatic detection of non-native speakers and their native language on relatively large data sets. We present several experiments which show how our system outperforms the best published results on both the Fisher database and the foreign-accented English (FAE) database for detecting non-native speakers and their native language respectively. Such performance is achieved by using an SVM-based classifier with ASR-based features integrated with a novel universal background model (UBM) obtained by clustering the Gaussian components of an ASR acoustic model. The second aspect of this work is to utilize the detected speaker characteristics within a speaker recognition system to improve its performance.
Keywords :
Gaussian distribution; acoustic signal processing; speaker recognition; support vector machines; Fisher database; Gaussian components; SVM-based classifier; acoustic model; automatic detection; automatic speech recognition; foreign-accented English database; native language; nonnative speaker detection; speaker recognition system; universal background model; Acoustic signal detection; Automatic speech recognition; Biometrics; Customer service; Detectors; Information security; Loudspeakers; Natural languages; Spatial databases; Speaker recognition; Accent detection; K-means clustering; non-native speaker detection; speaker verification;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495628