Title :
Whole word discriminative point process models
Author_Institution :
HLT Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
This paper introduces a discriminative extension to whole-word point process modeling techniques. Meant to circumvent the strong independence assumptions of their generative predecessors, discriminative point process models (DPPM) are trained to distinguish the composite temporal patterns of phonetic events produced for a given word from those of its impostors. Using correct and incorrect word hypotheses extracted from large vocabulary recognizer lattices, we train whole-word DPPMs to provide an alternative set of acoustic model scores. Using solely the timing of sparse phonetic events, DPPM scores exhibit comparable discriminative power to those produced by a state-of-the-art acoustic model built using the IBM Attila Speech Recognition Toolkit. In addition, the inherent complementarity of frame-based and event-based models permits significant improvements from score combination.
Keywords :
speech recognition; DPPM; IBM Attila speech recognition toolkit; incorrect word hypotheses; vocabulary recognizer lattices; whole word discriminative point process models; Acoustics; Computational modeling; Kernel; Lattices; Speech; Speech recognition; Training; discriminative training; point process model; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947524