Unsupervised Stream Weight Computation in a Segmentaion Task: Application to Audio-Visual Speech Recognition

Author

Sanchez-Soto, E. ; Daoudi, Khalid ; Potamianos, Alexandros

Author_Institution

IRIT-CNRS, Toulouse, France

fYear

2007

fDate

24-27 Nov. 2007

Firstpage

800

Lastpage

803

Abstract

We propose an efficient algorithm for unsupervised stream weight estimation in a segmentation task. Our method uses only the information carried by the test signal and the trained models. The work is based on results presented previously for the classification problem where it is indicated that the optimal stream weights are inversely proportional to the single stream misclassification error. We approximate this error relation by the intra- and inter-class distance ratio over the measured class distributions. This approach is then generalized to the segmentation problem by computing the distances among all the concerned classes. The proposed unsupervised estimation algorithm is evaluated on a an audio-visual speech recognition task. The obtained performances are comparable to the supervised minimum error training approach, up to a certain SNR level.

Keywords

audio-visual systems; least mean squares methods; speech recognition; video signal processing; audio-visual speech recognition; inter-class distance ratio; intra-class distance ratio; segmentation problem; unsupervised estimation algorithm; unsupervised stream weight computation; Audio recording; Automatic speech recognition; Information resources; Lips; Signal processing; Signal processing algorithms; Speech processing; Speech recognition; Streaming media; Testing; Audio-Visual Speech Recognition; Fusion Methods; Stream Weight Estimation;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on

Conference_Location

Dubai

Print_ISBN

978-1-4244-1235-8

Electronic_ISBN

978-1-4244-1236-5

Type

conf

DOI

10.1109/ICSPC.2007.4728440

Filename

4728440