Title :
Front-end feature transforms with context filtering for speaker adaptation
Author :
Huang, Jing ; Visweswariah, Karthik ; Olsen, Peder ; Goel, Vaibhava
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Feature-space transforms such as feature-space maximum likelihood linear regression (FMLLR) are very effective speaker adaptation technique, especially on mismatched test data. In this study, we extend the full-rank square matrix of FMLLR to a non-square matrix that uses neighboring feature vectors in estimating the adapted central feature vector. Through optimizing an appropriate objective function we aim to filter out and transform features through the correlation of the feature context. We compare to FMLLR that just con sider the current feature vector only. Our experiments are conducted on the automobile data with different speed conditions. Results show that context filtering improves 23% on word error rate over conventional FMLLR on noisy 60mph data with adapted ML model, and 7%/9% improvement over the discriminatively trained FMMI/BMMI models.
Keywords :
filtering theory; matrix algebra; regression analysis; speaker recognition; transforms; vectors; FMLLR; FMMI-BMMI model; adapted central feature vector estimation; context filtering; feature-space maximum likelihood linear regression; front-end feature-space transform; full-rank square matrix; mismatched test data; non-square matrix; objective function optimization; speaker adaptation technique; velocity 60 mph; Adaptation models; Context; Context modeling; Data models; Hidden Markov models; Noise measurement; Transforms; Feature-space transforms; context filtering; feature-space maximum likelihood linear regression;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947339