• DocumentCode
    590875
  • Title

    Using the visual Words based on Affine-SIFT descriptors for face recognition

  • Author

    Yu-Shan Wu ; Heng-Sung Liu ; Gwo-Hwa Ju ; Ting-Wei Lee ; Yen-Lin Chiu

  • Author_Institution
    Chunghwa Telecommun. Labs., Taoyuan, Taiwan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Video-based face recognition has drawn a lot of attention in recent years. On the other hand, Bag-of-visual Words (BoWs) representation has been successfully applied in image retrieval and object recognition recently. In this paper, a video-based face recognition approach which uses visual words is proposed. In classic visual words, Scale Invariant Feature Transform (SIFT) descriptors of an image are firstly extracted on interest points detected by difference of Gaussian (DoG), then k-means-based visual vocabulary generation is applied to replace these descriptors with the indexes of the closet visual words. However, in facial images, SIFT descriptors are not good enough due to facial pose distortion, facial expression and lighting condition variation. In this paper, we use Affine-SIFT (ASIFT) descriptors as facial image representation. Experimental results on UCSD/Honda Video Database and VidTIMIT Video Database suggest that visual words based on Affine-SIFT descriptors can achieve lower error rates in face recognition task.
  • Keywords
    Gaussian processes; face recognition; image representation; image retrieval; object recognition; transforms; video signal processing; ASIFT descriptors; BoW representation; DoG; UCSD-Honda video database; VidTIMIT video database; affine-scale invariant feature transform descriptors; bag-of-visual word representation; difference of Gaussian; facial expression; facial image representation; facial images; facial pose distortion; image retrieval; k-means-based visual vocabulary generation; lighting condition variation; object recognition; video-based face recognition; Face; Face recognition; Transforms; Video sequences; Visual databases; Visualization; Affine-SIFT; SIFT; face recognition; visual words;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6412022