Title :
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop
Author :
Livescu, Karen ; Cetin, Omer ; Hasegawa-Johnson, Mark ; King, Simon ; Bartels, Christopher ; Borges, N. ; Kantor, Amir ; Lal, Pyare ; Yung, L. ; Bezman, A. ; Dawson-Haggerty, Stephen ; Woods, B. ; Frankel, Jorg ; Magami-Doss, M. ; Saenko, Kate
Author_Institution :
Massachusetts Inst. of Technol., MA, USA
Abstract :
We report on investigations, conducted at the 2006 Johns Hopkins Workshop, into the use of articulatory features (AFs) for observation and pronunciation models in speech recognition. In the area of observation modeling, we use the outputs of AF classifiers both directly, in an extension of hybrid HMM/neural network models, and as part of the observation vector, an extension of the "tandem" approach. In the area of pronunciation modeling, we investigate a model having multiple streams of AF states with soft synchrony constraints, for both audio-only and audio-visual recognition. The models are implemented as dynamic Bayesian networks, and tested on tasks from the small-vocabulary switchboard (SVitchboard) corpus and the CUAVE audio-visual digits corpus. Finally, we analyze AF classification and forced alignment using a newly collected set of feature-level manual transcriptions.
Keywords :
Bayes methods; feature extraction; hidden Markov models; neural nets; speech processing; speech recognition; 2006 JHU Summer Workshop; CUAVE audio-visual digits corpus; HMM; acoustic; articulatory feature-based methods; audio-visual recognition; audio-visual speech recognition; dynamic Bayesian networks; feature-level manual transcriptions; neural network models; pronunciation models; small-vocabulary switchboard; soft synchrony constraints; Speech recognition; speech processing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366989