DocumentCode
590875
Title
Using the visual Words based on Affine-SIFT descriptors for face recognition
Author
Yu-Shan Wu ; Heng-Sung Liu ; Gwo-Hwa Ju ; Ting-Wei Lee ; Yen-Lin Chiu
Author_Institution
Chunghwa Telecommun. Labs., Taoyuan, Taiwan
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
1
Lastpage
5
Abstract
Video-based face recognition has drawn a lot of attention in recent years. On the other hand, Bag-of-visual Words (BoWs) representation has been successfully applied in image retrieval and object recognition recently. In this paper, a video-based face recognition approach which uses visual words is proposed. In classic visual words, Scale Invariant Feature Transform (SIFT) descriptors of an image are firstly extracted on interest points detected by difference of Gaussian (DoG), then k-means-based visual vocabulary generation is applied to replace these descriptors with the indexes of the closet visual words. However, in facial images, SIFT descriptors are not good enough due to facial pose distortion, facial expression and lighting condition variation. In this paper, we use Affine-SIFT (ASIFT) descriptors as facial image representation. Experimental results on UCSD/Honda Video Database and VidTIMIT Video Database suggest that visual words based on Affine-SIFT descriptors can achieve lower error rates in face recognition task.
Keywords
Gaussian processes; face recognition; image representation; image retrieval; object recognition; transforms; video signal processing; ASIFT descriptors; BoW representation; DoG; UCSD-Honda video database; VidTIMIT video database; affine-scale invariant feature transform descriptors; bag-of-visual word representation; difference of Gaussian; facial expression; facial image representation; facial images; facial pose distortion; image retrieval; k-means-based visual vocabulary generation; lighting condition variation; object recognition; video-based face recognition; Face; Face recognition; Transforms; Video sequences; Visual databases; Visualization; Affine-SIFT; SIFT; face recognition; visual words;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location
Hollywood, CA
Print_ISBN
978-1-4673-4863-8
Type
conf
Filename
6412022
Link To Document