Title :
Estimation of Speaking Speed for Faster Face Detection in Video-Footage
Author_Institution :
Fac. of Eng., Takushoku Univ., Tokyo
Abstract :
We previously reported a face detection system based on color segmentation using HSV. It was shown that the color is more effective than other colors not only in accurate segmentation but also in effective extraction of facial features. The first is crucial for detection and the latter for recognition. When it comes to video footages of news program, sound often accompanies the video and persons express themselves by moving facial parts while speaking. In this paper, we improve the face detection in speed using both sound and video in a combined way. First, the rate of syllables spoken is estimated from the sound. Next, for a beginning short video clip of each new scene, a differential image is formed with the frame distance corresponding to the rate to find mouth and eyes. This enables us to reduce the number of sampling points for segmentation to a great degree and to enhance the reliability of the detection. Also, music is discriminated from speaking by the estimation. These contribute to much faster detection of face
Keywords :
face recognition; feature extraction; image colour analysis; image sampling; image segmentation; video retrieval; HSV; color segmentation; facial feature extraction; facial recognition; faster face detection; sampling point; speaking speed estimation; video clip; video-footage; Eyes; Face detection; Face recognition; Facial features; Image retrieval; Image sampling; Image segmentation; Layout; Mouth; Speech analysis;
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-7803-9331-7
DOI :
10.1109/ICME.2005.1521455