Title :
Speaker adaptation based on speaker-dependent eigenphone estimation
Author :
Zhang, Wen-Lin ; Zhang, Wei-Qiang ; Li, Bi-Cheng
Author_Institution :
Dept. of Inf. Sci., Zhengzhou Inf. Sci. & Technol. Inst., Zhengzhou, China
Abstract :
Based on speaker dependent eigenphone estimation, a novel speaker adaptation technique is proposed in this paper. Different from conventional speaker adaptation approaches, the proposed method explicitly models the phone variations for each speaker through subspace modeling in the phone space. The phone coordinate, which is shared by all speakers, contains correlation information between different phones. During speaker adaptation, two schemes for estimation of the new speaker specific phone variation bases (namely eigenphones) are derived under maximum likelihood (ML) criterion and maximum a posteriori (MAP) criterion respectively. Supervised speaker adaptation experiments on a Mandarin Chinese continuous speech recognition task show that the new method outperforms both eigenvoice and maximum likelihood linear regression (MLLR) methods when sufficient adaptation data is available.
Keywords :
eigenvalues and eigenfunctions; maximum likelihood estimation; natural languages; regression analysis; speech recognition; Mandarin Chinese continuous speech recognition; eigenvoice; maximum a posteriori criterion; maximum likelihood criterion; maximum likelihood linear regression; phone coordinate; phone space; phone variations; speaker-dependent eigenphone estimation; subspace modeling; supervised speaker adaptation; Adaptation models; Correlation; Hidden Markov models; Maximum likelihood estimation; Training; Vectors;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163904