Title :
Fusion of visual and acoustic signals for command-word recognition
Author :
Kober, Rudolf ; Harz, U. ; Schiffers, Jutta
Author_Institution :
Res. Inst. for Appl. Knowledge Process., Ulm, Germany
Abstract :
We investigate the question of how the visual information of lip movement contributes to command-word recognition. The fusion of the acoustic and visual signal can be carried out either at the feature level or at the class level. Integration at the feature level means merging of the acoustic and visual features to yield a combined feature vector which is fed into a HMM-system. Fusion at the class level means separate classification of the two sources of information and combination of the classification results. An HMM classifier is used for the acoustic signal and three different classifiers (HMM, DTW and ClaRe) for the visual signal. The classification results are combined using the C4.5 decision tree classifier. The recognition rates of both fusion schemes are comparable. Both yield small improvements at high SNRs using the acoustic/visual system in comparison to the acoustic system alone. Larger improvements (up to 12%) result at low SNRs
Keywords :
acoustic signal processing; feature extraction; hidden Markov models; image processing; sensor fusion; speech processing; speech recognition; C4.5 decision tree classifier; ClaRe classifier; DTW; HMM classifier; HMM system; acoustic features; acoustic signals; acoustic/visual system; class level; classification results; combined feature vector; command word recognition; feature level; high SNR; lip movement; low SNR; recognition rates; signals fusion; source classification; visual features; visual information; visual signals; Feeds; Fuses; Hidden Markov models; Merging; Mouth; Neural networks; Signal to noise ratio; Speech recognition; Testing; Visual system;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596233