DocumentCode :
835932
Title :
Advances in phone-based modeling for automatic accent classification
Author :
Angkititrakul, Pongtep ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst., Univ. of Texas, Richardson, TX, USA
Volume :
14
Issue :
2
fYear :
2006
fDate :
3/1/2006 12:00:00 AM
Firstpage :
634
Lastpage :
646
Abstract :
It is suggested that algorithms capable of estimating and characterizing accent knowledge would provide valuable information in the development of more effective speech systems such as speech recognition, speaker identification, audio stream tagging in spoken document retrieval, channel monitoring, or voice conversion. Accent knowledge could be used for selection of alternative pronunciations in a lexicon, engage adaptation for acoustic modeling, or provide information for biasing a language model in large vocabulary speech recognition. In this paper, we propose a text-independent automatic accent classification system using phone-based models. Algorithm formulation begins with a series of experiments focused on capturing the spectral evolution information as potential accent sensitive cues. Alternative subspace representations using principal component analysis and linear discriminant analysis with projected trajectories are considered. Finally, an experimental study is performed to compare the spectral trajectory model framework to a traditional hidden Markov model recognition framework using an accent sensitive word corpus. System evaluation is performed using a corpus representing five English speaker groups with native American English, and English spoken with Mandarin Chinese, French, Thai, and Turkish accents for both male and female speakers.
Keywords :
natural languages; principal component analysis; speech processing; speech recognition; automatic accent classification; linear discriminant analysis; phone-based modeling; principal component analysis; spectral evolution information; text-independent accent classification; Adaptation model; Hidden Markov models; Information retrieval; Loudspeakers; Monitoring; Natural languages; Speech recognition; Streaming media; Tagging; Vocabulary; Automatic accent classification; dialect modeling; open accent classification; phoneme recognition; spectral trajectory modeling; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.851980
Filename :
1597266
Link To Document :
بازگشت