DocumentCode :
2723824
Title :
A Robust Speaking Face Modelling Approach Based on Multilevel Fusion
Author :
Chetty, Girija ; Wagner, Michael
fYear :
2007
fDate :
3-5 Dec. 2007
Firstpage :
408
Lastpage :
415
Abstract :
approach based on multilevel fusion of 3D face biometric information with audio and visual speech information for biometric identity verification applications. The proposed approach combines the information from three audio-video based modules, namely: audio, visual speech, and 3D face and performs tri-module fusion in an automatic, unsupervised and adaptive manner, by adapting to the local performance of each module. This is done by taking the output-score based reliability estimates (confidence measures) of each of the module into account. The module weightings are determined automatically such that the reliability measure of the combined scores is maximised. To test the robustness of the proposed approach, the audio and visual speech (mouth) modalities are degraded to emulate various levels of train/test mismatch; employing additive white Gaussian noise for the audio and JPEG compression for the video signals. The results show improved fusion performance for a range of tested levels of audio and video degradation, compared to the individual module performances. Experiments on a 3D stereovision database AVOZES show that, at severe levels of audio and video mismatch, the audio, mouth, 3D face, and tri-module (audio+mouth+3D face) fusion EERs were 42.9%, 32%, 15%, and 7.3% respectively for biometric speaker identity verification application. I.
Keywords :
Additive white noise; Biometrics; Degradation; Mouth; Noise robustness; Performance evaluation; Speech enhancement; Testing; Transform coding; Video compression;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Image Computing Techniques and Applications, 9th Biennial Conference of the Australian Pattern Recognition Society on
Conference_Location :
Glenelg, Australia
Print_ISBN :
0-7695-3067-2
Type :
conf
DOI :
10.1109/DICTA.2007.4426826
Filename :
4426826
Link To Document :
بازگشت