Using the visual Words based on Affine-SIFT descriptors for face recognition

Author

Yu-Shan Wu ; Heng-Sung Liu ; Gwo-Hwa Ju ; Ting-Wei Lee ; Yen-Lin Chiu

Author_Institution

Chunghwa Telecommun. Labs., Taoyuan, Taiwan

fYear

2012

fDate

3-6 Dec. 2012

Firstpage

1

Lastpage

5

Abstract

Video-based face recognition has drawn a lot of attention in recent years. On the other hand, Bag-of-visual Words (BoWs) representation has been successfully applied in image retrieval and object recognition recently. In this paper, a video-based face recognition approach which uses visual words is proposed. In classic visual words, Scale Invariant Feature Transform (SIFT) descriptors of an image are firstly extracted on interest points detected by difference of Gaussian (DoG), then k-means-based visual vocabulary generation is applied to replace these descriptors with the indexes of the closet visual words. However, in facial images, SIFT descriptors are not good enough due to facial pose distortion, facial expression and lighting condition variation. In this paper, we use Affine-SIFT (ASIFT) descriptors as facial image representation. Experimental results on UCSD/Honda Video Database and VidTIMIT Video Database suggest that visual words based on Affine-SIFT descriptors can achieve lower error rates in face recognition task.

Keywords

Gaussian processes; face recognition; image representation; image retrieval; object recognition; transforms; video signal processing; ASIFT descriptors; BoW representation; DoG; UCSD-Honda video database; VidTIMIT video database; affine-scale invariant feature transform descriptors; bag-of-visual word representation; difference of Gaussian; facial expression; facial image representation; facial images; facial pose distortion; image retrieval; k-means-based visual vocabulary generation; lighting condition variation; object recognition; video-based face recognition; Face; Face recognition; Transforms; Video sequences; Visual databases; Visualization; Affine-SIFT; SIFT; face recognition; visual words;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific

Conference_Location

Hollywood, CA

Print_ISBN

978-1-4673-4863-8

Type

conf

Filename

6412022