DocumentCode :
1474852
Title :
Online Learning and Acoustic Feature Adaptation in Large-Margin Hidden Markov Models
Author :
Cheng, Chih-Chieh ; Sha, Fei ; Saul, Lawrence K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of California, La Jolla, CA, USA
Volume :
4
Issue :
6
fYear :
2010
Firstpage :
926
Lastpage :
942
Abstract :
We explore the use of sequential, mistake-driven updates for online learning and acoustic feature adaptation in large-margin hidden Markov models (HMMs). The updates are applied to the parameters of acoustic models after the decoding of individual training utterances. For large-margin training, the updates attempt to separate the log-likelihoods of correct and incorrect transcriptions by an amount proportional to their Hamming distance. For acoustic feature adaptation, the updates attempt to improve recognition by linearly transforming the features computed by the front end. We evaluate acoustic models trained in this way on the TIMIT speech database. We find that online updates for large-margin training not only converge faster than analogous batch optimizations, but also yield lower phone error rates than approaches that do not attempt to enforce a large margin. Finally, experimenting with different schemes for initialization and parameter-tying, we find that acoustic feature adaptation leads to further improvements beyond the already significant gains achieved by large-margin training.
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; Hamming distance; TIMIT speech database; acoustic feature adaptation; discriminative training; large-margin hidden Markov models; online learning; Automatic speech recognition; Computer science; Error analysis; Hidden Markov models; Machine learning algorithms; Management training; Maximum likelihood estimation; Parameter estimation; Permission; Training data; Acoustic feature adaptation; automatic speech recognition (ASR); discriminative training; hidden Markov models (HMMs); large-margin classification; online learning;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2010.2048607
Filename :
5451105
Link To Document :
بازگشت