• DocumentCode
    3673946
  • Title

    Latent max-margin metric learning for comparing video face tubes

  • Author

    Gaurav Sharma;Patrick Pérez

  • Author_Institution
    MPI Informatics, Germany
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    65
  • Lastpage
    74
  • Abstract
    Comparing “face tubes” is a key component of modern systems for face biometrics based video analysis and annotation. We present a novel algorithm to learn a distance metric between such spatio-temporal face tubes in videos. The main novelty in the algorithm is based on incorporation of latent variables in a max-margin metric learning framework. The latent formulation allows us to model, and learn metrics to compare faces under different challenging variations in pose, expressions and lighting. We propose a novel dataset named TV Series Face Tubes (TSFT) for evaluating the task. The dataset is collected from 12 different episodes of 8 popular TV series and has 94 subjects with 569 manually annotated face tracks in total. We show quantitatively how incorporating latent variables in max-margin metric learning leads to improvement of current state-of-the-art metric learning methods for the two cases when the testing is done with subjects that were seen during training and when the test subjects were not seen at all during training. We also give results on a challenging benchmark dataset: YouTube faces, and place our algorithm in context w.r.t. existing methods.
  • Keywords
    "Measurement","Lighting","Estimation","Electron tubes"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops (CVPRW), 2015 IEEE Conference on
  • Electronic_ISBN
    2160-7516
  • Type

    conf

  • DOI
    10.1109/CVPRW.2015.7301321
  • Filename
    7301321