مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker selection training for large vocabulary continuous speech recognition

DocumentCode :

542263

Title :

Speaker selection training for large vocabulary continuous speech recognition

Author :

Huang, Chao ; Chen, Tao ; Chang, Eric

Author_Institution :

Microsoft Research Asia, 5F, Sigma Center, No. 49, Zhichun Road, Beijing 100080, China

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

Acoustic variability across speakers is one of the challenges of speaker independent (SI) speech recognition systems. As a powerful solution, dominant speaker adaptation technologies such as MLLR and MAP may become inefficient because of the lack of enough enrollment data. In this paper, we propose an adaptation method based on speaker selection training, which makes full use of statistics of training corpus. Relative error rate reduction of 5.31 % is achieved when only one utterance is available. We compare different speaker selection strategies, namely. PCA, HMM and GMM based methods. In addition, impacts of number of selected cohort speakers and number of utterances from target speaker are investigated. Furthermore, comparison and integration with MLLR adaptation are also shown. Finally, some ongoing work such as dynamicalJy varying number of selected speakers, measuring the relative contribution among the selected speakers and speeding up the computationally expensive procedure of re-estimation with model synthesis are also discussed.

Keywords :

Adaptation model; Chaos; Data models; Hidden Markov models; Robustness; Speech recognition; Thyristors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743791

Filename :

5743791

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542263