• DocumentCode
    2695157
  • Title

    Automatic character identification in feature-length films

  • Author

    Zhang, Yi-Fan ; Xu, Changsheng ; Lu, Hanqing

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing
  • fYear
    2008
  • fDate
    June 23 2008-April 26 2008
  • Firstpage
    1469
  • Lastpage
    1472
  • Abstract
    This paper presents a novel approach to automatically identify characters in films using audio visual cues and text analysis. The approach consists of three stages: (i) frontal face track detection and clustering, (ii) face track classification, (iii) name assignment. A finite state machine (FSM) method is utilized to filter faces detected on each frame and build face tracks. The face tracks are clustered using constrained K-centers. The tracks located in the center area of each cluster are set as exemplars. The marginal points of each cluster and the newly detected non-frontal face tracks are classified to these exemplars using complementary cues of audio and visual. The names of characters are ranked based on their occurrences in the film script and the face track clusters are ranked based on track counts. The names are assigned to the clusters according to the ranking order. Experiments were conducted on two feature-length films and gave promising results.
  • Keywords
    character recognition; face recognition; finite state machines; image classification; audio visual cues; automatic character identification; constrained K-centers; face track classification; feature-length films; finite state machine method; frontal face track detection; name assignment; text analysis; Automata; Automation; Conductive films; Face detection; Face recognition; Filters; Motion pictures; Pattern recognition; Scattering; Text analysis; face recognition; movie analysis; speaker identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2008 IEEE International Conference on
  • Conference_Location
    Hannover
  • Print_ISBN
    978-1-4244-2570-9
  • Electronic_ISBN
    978-1-4244-2571-6
  • Type

    conf

  • DOI
    10.1109/ICME.2008.4607723
  • Filename
    4607723