Title :
Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
Author :
Botterweck, Henrik
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Abstract :
A general method is examined, which unifies the eigenvoice approach and MAP adaptation. The a priori distribution for MAP is chosen to be anisotropic with the eigenvoices as preferred directions while still allowing adaptation into all other directions. This allows the exploitation of a priori knowledge about typical speaker variability within the MAP framework. This approach has two advantages: long term adaptation leads to the same good results as the MAP method, whereas for ultra-short adaptation in the range of 1-2 seconds an overfitting as for maximum likelihood techniques is avoided. The method is applied to large vocabulary continuous speech recognition. Results are to be compared with our transfer of the maximum likelihood eigenvoice method to LVCSR (Botterweck 2000). Even after only one recognized word significant improvements of the WER of up to 6% relative are observed for gender independent recognition. 14% improvements are obtained after 5 seconds
Keywords :
eigenvalues and eigenfunctions; maximum likelihood estimation; speech recognition; WER; a priori distribution; anisotropic MAP; eigenvoices; gender independent recognition; large vocabulary continuous speech recognition; long term adaptation; speaker variability; ultra-short adaptation; word error rate; Anisotropic magnetoresistance; Hidden Markov models; Maximum likelihood linear regression; Probability distribution; Robustness; Speech recognition; Vectors; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940840