DocumentCode
394301
Title
Comparison of acoustic model adaptation techniques on non-native speech
Author
Zhirong Wang ; Schultz, Tanja ; Waibel, Alex
Author_Institution
Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
The performance of speech recognition systems is consistently poor on non-native speech. The challenge for non-native speech recognition is to maximize the recognition performance with a small amount of available non-native data. We report on acoustic modeling adaptation for the recognition of non-native speech. Using non-native data from German speakers, we investigate how bilingual models, speaker adaptation, acoustic model interpolation and polyphone decision tree specialization methods can help to improve the recognizer performance. Results obtained from the experiments demonstrate the feasibility of these methods.
Keywords
acoustic signal processing; decision trees; interpolation; natural languages; optimisation; speaker recognition; German speakers; acoustic model adaptation techniques; acoustic model interpolation; bilingual models; nonnative speech recognition; polyphone decision tree specialization; speaker adaptation; Adaptation model; Databases; Decision trees; Hidden Markov models; Interpolation; Loudspeakers; Natural languages; Speech recognition; Testing; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198837
Filename
1198837
Link To Document