DocumentCode :
3423509
Title :
Quick fmllr for speaker adaptation in speech recognition
Author :
Varadarajan, Balakrishnan ; Povey, Daniel ; Chu, Stephen M.
Author_Institution :
Johns Hopkins Univ., Baltimore, MD
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4297
Lastpage :
4300
Abstract :
Feature space maximum likelihood linear regression (fMLLR) is a widely used technique for speaker adaptation in HMM-based speech recognition. However, in extremely resource constrained systems the time required to perform the sufficient statistics accumulation for fMLLR adaptation can be considerable. In this paper we describe a novel method that can lead to significant reduction in the time taken for statistics accumulation while preserving the adaptation gains. The proposed quick fMLLR (Q-fMLLR) algorithm is implemented in a state-of-the-art large-vocabulary continuous speech recognition system, and evaluated on a broadcast transcription task. We present results both in terms of the average likelihood after adaptation and the character error rate. It is shown that Q-fMLLR attains the performance of regular fMLLR with a fraction of the computation.
Keywords :
hidden Markov models; maximum likelihood estimation; regression analysis; speech recognition; HMM; character error rate; feature space maximum likelihood linear regression; speaker adaptation; speech recognition; Approximation algorithms; Broadcasting; Convergence; Covariance matrix; Error analysis; Gaussian processes; Maximum likelihood linear regression; Speech recognition; Statistics; Time factors; MLLR; constrained MLLR; fMLLR; feature space MLLR; speaker adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518605
Filename :
4518605
Link To Document :
بازگشت