DocumentCode :
1880998
Title :
Audio-visual talking face detection
Author :
Li, Mingkun ; Li, Dongge ; Dimitrova, Nevenka ; Sethi, Ishwar
Author_Institution :
Intelligent Inf. Eng. Lab, Oakland Univ., Rochester, MI, USA
Volume :
2
fYear :
2003
fDate :
6-9 July 2003
Abstract :
Talking face detection is important for videoconferencing. However, the detection of the talking face is difficult because of the low resolution of the capturing devices, the informal style of communication and the background sounds. In this paper, we present a novel method for finding the talking face using latent semantic indexing approach. We tested our method on a comprehensive set of home video conferencing sessions with a very high detection rate. Our experiments show that the LSI method accuracy degrades gracefully in a noisy environment as opposed to the correlation method which simply fails in presence of noise.
Keywords :
audio-visual systems; face recognition; indexing; large scale integration; speech processing; speech recognition; teleconferencing; video signal processing; audio-visual talking face detection; face-speech matching; latent semantic indexing approach; videoconferencing; Acoustic noise; Correlation; Degradation; Face detection; Indexing; Large scale integration; Teleconferencing; Testing; Videoconference; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
Type :
conf
DOI :
10.1109/ICME.2003.1221656
Filename :
1221656
Link To Document :
بازگشت