• DocumentCode
    114348
  • Title

    Multi-feature subspace analysis for audio-vidoe based multi-modal person recognition

  • Author

    Dihong Gong ; Na Li ; Zhifeng Li ; Yu Qiao

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • fYear
    2014
  • fDate
    26-28 April 2014
  • Firstpage
    776
  • Lastpage
    779
  • Abstract
    Biometric person recognition has received a lot of attention in recent years due to the growing security demands in commercial and law enforcement applications. However, using a single biometric has several problems. In order to alleviate these problems, multi-modal biometric systems are proposed by combining various biometric modalities to improve the robustness of person authentication. A typical application is to combine both audio and face for multimodal person recognition, since either face or voice is among the most natural biometrics that people use to identify each other. In this paper, a novel approach called multi-feature subspace analysis (MFSA) is proposed for audio-video based biometric person recognition. In the MFSA framework, each face sequence or utterance is represented with a fix-length feature vector, and then subspace analysis method is performed on a collection of random subspaces to construct an ensemble of classifiers for robust recognition. Experiments on the XM2VTSDB corpus sufficiently validate the feasibility and effectiveness of our new approach.
  • Keywords
    face recognition; feature extraction; random processes; speaker recognition; MFSA framework; XM2VTSDB corpus; audio-video based biometric person recognition; audio-video based multimodal person recognition; biometric person recognition; commercial applications; face sequence; fix-length feature vector; law enforcement applications; multifeature subspace analysis method; multimodal biometric systems; natural biometrics; random subspaces; robust recognition; security demands; utterance; Databases; Face; Face recognition; Feature extraction; Speech; Testing; Training; Audio-Video Based Person Recognition; Subspace Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Technology (ICIST), 2014 4th IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/ICIST.2014.6920592
  • Filename
    6920592