DocumentCode
1749670
Title
Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
Author
Botterweck, Henrik
Author_Institution
Philips GmbH Forschungslab., Aachen, Germany
Volume
1
fYear
2001
fDate
2001
Firstpage
353
Abstract
A general method is examined, which unifies the eigenvoice approach and MAP adaptation. The a priori distribution for MAP is chosen to be anisotropic with the eigenvoices as preferred directions while still allowing adaptation into all other directions. This allows the exploitation of a priori knowledge about typical speaker variability within the MAP framework. This approach has two advantages: long term adaptation leads to the same good results as the MAP method, whereas for ultra-short adaptation in the range of 1-2 seconds an overfitting as for maximum likelihood techniques is avoided. The method is applied to large vocabulary continuous speech recognition. Results are to be compared with our transfer of the maximum likelihood eigenvoice method to LVCSR (Botterweck 2000). Even after only one recognized word significant improvements of the WER of up to 6% relative are observed for gender independent recognition. 14% improvements are obtained after 5 seconds
Keywords
eigenvalues and eigenfunctions; maximum likelihood estimation; speech recognition; WER; a priori distribution; anisotropic MAP; eigenvoices; gender independent recognition; large vocabulary continuous speech recognition; long term adaptation; speaker variability; ultra-short adaptation; word error rate; Anisotropic magnetoresistance; Hidden Markov models; Maximum likelihood linear regression; Probability distribution; Robustness; Speech recognition; Vectors; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940840
Filename
940840
Link To Document