• DocumentCode
    12774
  • Title

    Unsupervised Celebrity Face Naming in Web Videos

  • Author

    Lei Pang ; Chong-Wah Ngo

  • Author_Institution
    Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
  • Volume
    17
  • Issue
    6
  • fYear
    2015
  • fDate
    Jun-15
  • Firstpage
    854
  • Lastpage
    866
  • Abstract
    This paper investigates the problem of celebrity face naming in unconstrained videos with user-provided metadata . Instead of relying on accurate face labels for supervised learning, a rich set of relationships automatically derived from video content and knowledge from image domain and social cues is leveraged for unsupervised face labeling. The relationships refer to the appearances of faces under different spatio-temporal contexts and their visual similarities. The knowledge includes Web images weakly tagged with celebrity names and the celebrity social networks. The relationships and knowledge are elegantly encoded using conditional random field (CRF) for label inference. Two versions of face annotation are considered: within-video and between-video face labeling. The former addresses the problem of incomplete and noisy labels in metadata, where null assignment of names is allowed-a problem seldom been considered in the literature. The latter further rectifies the errors in metadata, specifically to correct false labels and annotate faces with missing names in the metadata of a video, by considering a group of socially connected videos for joint label inference. Experimental results on a large archive of Web videos show the robustness of the proposed approach in dealing with the problems of missing and false labels, leading to higher accuracy in face labeling than several existing approaches but with minor degradation in speed efficiency.
  • Keywords
    Internet; face recognition; meta data; random processes; social networking (online); unsupervised learning; video signal processing; CRF; Web images; Web videos; between-video face labeling; celebrity social networks; conditional random field; false labels; image domain; joint label inference; missing labels; null assignment; social cues; socially connected videos; spatio-temporal contexts; unconstrained videos; unsupervised celebrity face naming; unsupervised face labeling; user-provided metadata; video content; visual similarities; within-video face labeling; Face; Labeling; Mathematical model; Social network services; Training; Videos; Visualization; Celebrity face naming; social network; unconstrained web videos; unsupervised;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2015.2419452
  • Filename
    7078858