DocumentCode
417278
Title
Integrating thumbnail features for speech recognition using conditional exponential models
Author
Yu, Hua ; Waibel, Alex
Author_Institution
Interactive Syst. Labs, Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We describe a novel approach for modeling segmental information in speech recognition, through the use of thumbnail features. By taking into account dependencies at the segmental level, thumbnail features are more resistant to changes in speaking rates and other factors. While the traditional acoustic features are fixed for every utterance, one set of thumbnail features is computed for each hypothesis, which may violate the traditional scoring paradigm. To this end, we introduce a conditional exponential modeling framework. It allows better integration of various knowledge sources in a discriminative fashion. We present preliminary experiments on the Switchboard task.
Keywords
feature extraction; hidden Markov models; learning (artificial intelligence); speech recognition; HMM; Switchboard task; conditional exponential models; segmental information; speech recognition; thumbnail features; Handwriting recognition; Hidden Markov models; Image segmentation; Interactive systems; Machine learning; Pattern matching; Proposals; Robustness; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326130
Filename
1326130
Link To Document