DocumentCode :
542263
Title :
Speaker selection training for large vocabulary continuous speech recognition
Author :
Huang, Chao ; Chen, Tao ; Chang, Eric
Author_Institution :
Microsoft Research Asia, 5F, Sigma Center, No. 49, Zhichun Road, Beijing 100080, China
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
Acoustic variability across speakers is one of the challenges of speaker independent (SI) speech recognition systems. As a powerful solution, dominant speaker adaptation technologies such as MLLR and MAP may become inefficient because of the lack of enough enrollment data. In this paper, we propose an adaptation method based on speaker selection training, which makes full use of statistics of training corpus. Relative error rate reduction of 5.31 % is achieved when only one utterance is available. We compare different speaker selection strategies, namely. PCA, HMM and GMM based methods. In addition, impacts of number of selected cohort speakers and number of utterances from target speaker are investigated. Furthermore, comparison and integration with MLLR adaptation are also shown. Finally, some ongoing work such as dynamicalJy varying number of selected speakers, measuring the relative contribution among the selected speakers and speeding up the computationally expensive procedure of re-estimation with model synthesis are also discussed.
Keywords :
Adaptation model; Chaos; Data models; Hidden Markov models; Robustness; Speech recognition; Thyristors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743791
Filename :
5743791
Link To Document :
بازگشت