DocumentCode
2177428
Title
Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields
Author
Demuynck, Kris ; Seppi, Dino ; Van Compernolle, Dirk ; Nguyen, Patrick ; Zweig, Geoffrey
Author_Institution
ESAT, Katholieke Univ. Leuven, Leuven, Belgium
fYear
2011
fDate
22-27 May 2011
Firstpage
5048
Lastpage
5051
Abstract
Exemplar based recognition systems are characterized by the fact that, instead of abstracting large amounts of data into compact models, they store the observed data enriched with some annotations and infer on-the-fly from the data by finding those exemplars that resemble the input speech best. One advantage of exemplar based systems is that next to deriving what the current phone or word is, one can easily derive a wealth of meta-information concerning the chunk of audio under investigation. In this work we harvest meta-information from the set of best matching exemplars, that is thought to be relevant for the recognition such as word boundary predictions and speaker entropy. Integrating this meta-information into the recognition framework using segmental conditional random fields, reduced the WER of the exemplar based system on the WSJ Nov92 20k task from 8.2% to 7.6%. Adding the HMM-score and multiple HMM phone detectors as features further reduced the error rate to 6.6%.
Keywords
hidden Markov models; speech processing; speech recognition; HMM-score phone detector; WER; WSJ task; boundary prediction; exemplar-based speech recognition; metainformation integration; multiple HMM phone detector; segmental conditional random field; speaker entropy; Acoustics; Databases; Detectors; Hidden Markov models; Speech; Speech recognition; Training; Conditional Random Fields; Example Based Recognition; SCARF; Speech Recognition; Template Based Recognition; k Nearest Neighbours;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947491
Filename
5947491
Link To Document