DocumentCode :
2178230
Title :
Whole word discriminative point process models
Author :
Jansen, Aren
Author_Institution :
HLT Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
5180
Lastpage :
5183
Abstract :
This paper introduces a discriminative extension to whole-word point process modeling techniques. Meant to circumvent the strong independence assumptions of their generative predecessors, discriminative point process models (DPPM) are trained to distinguish the composite temporal patterns of phonetic events produced for a given word from those of its impostors. Using correct and incorrect word hypotheses extracted from large vocabulary recognizer lattices, we train whole-word DPPMs to provide an alternative set of acoustic model scores. Using solely the timing of sparse phonetic events, DPPM scores exhibit comparable discriminative power to those produced by a state-of-the-art acoustic model built using the IBM Attila Speech Recognition Toolkit. In addition, the inherent complementarity of frame-based and event-based models permits significant improvements from score combination.
Keywords :
speech recognition; DPPM; IBM Attila speech recognition toolkit; incorrect word hypotheses; vocabulary recognizer lattices; whole word discriminative point process models; Acoustics; Computational modeling; Kernel; Lattices; Speech; Speech recognition; Training; discriminative training; point process model; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947524
Filename :
5947524
Link To Document :
بازگشت