DocumentCode
2044285
Title
Unsupervised Stream Weight Computation in a Segmentaion Task: Application to Audio-Visual Speech Recognition
Author
Sanchez-Soto, E. ; Daoudi, Khalid ; Potamianos, Alexandros
Author_Institution
IRIT-CNRS, Toulouse, France
fYear
2007
fDate
24-27 Nov. 2007
Firstpage
800
Lastpage
803
Abstract
We propose an efficient algorithm for unsupervised stream weight estimation in a segmentation task. Our method uses only the information carried by the test signal and the trained models. The work is based on results presented previously for the classification problem where it is indicated that the optimal stream weights are inversely proportional to the single stream misclassification error. We approximate this error relation by the intra- and inter-class distance ratio over the measured class distributions. This approach is then generalized to the segmentation problem by computing the distances among all the concerned classes. The proposed unsupervised estimation algorithm is evaluated on a an audio-visual speech recognition task. The obtained performances are comparable to the supervised minimum error training approach, up to a certain SNR level.
Keywords
audio-visual systems; least mean squares methods; speech recognition; video signal processing; audio-visual speech recognition; inter-class distance ratio; intra-class distance ratio; segmentation problem; unsupervised estimation algorithm; unsupervised stream weight computation; Audio recording; Automatic speech recognition; Information resources; Lips; Signal processing; Signal processing algorithms; Speech processing; Speech recognition; Streaming media; Testing; Audio-Visual Speech Recognition; Fusion Methods; Stream Weight Estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location
Dubai
Print_ISBN
978-1-4244-1235-8
Electronic_ISBN
978-1-4244-1236-5
Type
conf
DOI
10.1109/ICSPC.2007.4728440
Filename
4728440
Link To Document