Title :
Confidence-driven iterative speaker adaptation in transcription-mode speech recognition
Author :
Ou, Jiazhi ; Chen, Kaijiang ; Li, Zongge
Author_Institution :
Dept. of Comput. Sci., Fudan Univ., Shanghai, China
Abstract :
Speaker adaptation in transcription-mode speech recognition means adaptation on the test set itself rather than on an independent data set. A confidence-driven iterative adaptation approach is presented. Speech recognition and speaker adaptation are performed alternately. A combination of MLLR (maximum likelihood linear regression) and MAP (maximum a posteriori) approaches is used for adaptation. To avoid erroneous transcription in adaptation, an N-best based confidence measure is introduced. The procedure includes a coarse classification and a delicate measurement. Experiments were carried out on a large vocabulary Mandarin continuous speech recognition. A baseline model for comparison was built. The experimental results showed that our proposed approach reduced the word error rate by 51.8% compared to the initial speaker-independent system and 39.3% compared to the baseline model
Keywords :
hidden Markov models; iterative methods; maximum likelihood estimation; natural languages; pattern classification; speech recognition; HMM; MAP approach; Mandarin; N-best based confidence measure; coarse classification; confidence-driven speaker adaptation; hidden Markov model; iterative speaker adaptation; large vocabulary continuous speech recognition; maximum a posteriori approach; maximum likelihood linear regression; transcription-mode speech recognition; Broadcasting; Computer science; Error analysis; Hidden Markov models; Iterative methods; Maximum likelihood linear regression; Speech processing; Speech recognition; Testing; Vocabulary;
Conference_Titel :
Info-tech and Info-net, 2001. Proceedings. ICII 2001 - Beijing. 2001 International Conferences on
Conference_Location :
Beijing
Print_ISBN :
0-7803-7010-4
DOI :
10.1109/ICII.2001.983656