Title :
Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition
Author :
Katsamanis, Athanassios ; Papandreou, George ; Pitsikalis, Vassilis ; Maragos, Petros
Author_Institution :
Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
Abstract :
In pattern recognition one usually relies on measuring a set of informative features to perform tasks such as classification. While the accuracy of feature measurements heavily depends on changing environmental conditions, studying the consequences of this fact has received relatively little attention to date. In this work we explicitly take into account uncertainty in feature measurements and we show in a rigorous probabilistic framework how the models used for classification should be adjusted to compensate for this effect. Our approach proves to be particularly fruitful in multimodal fusion scenarios, such as audio-visual speech recognition, where multiple streams of complementary features are integrated. For such applications, provided that an estimate of the measurement noise uncertainty for each feature stream is available, we show that the proposed framework leads to highly adaptive multimodal fusion rules which are widely applicable and easy to implement. We further show that previous multimodal fusion methods relying on stream weights fall under our scheme if certain assumptions hold; this provides novel insights into their applicability for various tasks and suggests new practical ways for estimating the stream weights adaptively. Preliminary experimental results in audio-visual speech recognition demonstrate the potential of our approach.
Keywords :
speech recognition; adaptive compensation; audio-visual speech recognition; feature uncertainty; measurement noise; multimodal fusion; pattern recognition; Face; Noise; Shape; Speech; Speech recognition; Uncertainty; Visualization;
Conference_Titel :
Signal Processing Conference, 2006 14th European
Conference_Location :
Florence