Title :
Audio-Guided Video-Based Face Recognition
Author :
Tang, Xiaoou ; Li, Zhifeng
Author_Institution :
Dept. of Inf. Eng., Chinese Univ. of Hong Kong (CUHK), Hong Kong, China
fDate :
7/1/2009 12:00:00 AM
Abstract :
In this paper, we develop a new video-to-video face recognition algorithm. The major advantage of the video-based method is that more information is available in a video sequence than in a single image. In order to take advantage of the large amount of information in the video sequence and at the same time overcome the processing speed and data size problems, we develop several new techniques including temporal and spatial frame synchronization, multilevel discriminant subspace analysis, and multiclassifier integration for video sequence processing. An aligned video sequence for each person is first obtained by applying temporal and spatial synchronization, which effectively establishes the face correspondence using both audio and video information; then multilevel discriminant subspace analysis or multiclassifier integration is employed for further analysis based on the synchronized sequence. The method preserves most of the temporal-spatial information contained in a video sequence. Extensive experiments on the XM2VTS database clearly show the superiority of our new algorithms with near-perfect classification results (99.3%) obtained.
Keywords :
audio signal processing; face recognition; image classification; matrix algebra; principal component analysis; speech recognition; video signal processing; LDA; PCA; XM2VTS database; audio-guided video-to-video face recognition algorithm; between-class scatter matrix; face correspondence; multiclassifier integration; multilevel discriminant subspace analysis; spatial frame synchronization; speech recognition; temporal synchronization; video sequence processing; Face recognition; subspace analysis; video processing;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2009.2022694