• DocumentCode
    3022057
  • Title

    A Multimodality Framework for Creating Speaker/Non-Speaker Profile Databases for Real-World Video

  • Author

    Abbas, Jehanzeb ; Dagli, Charlie K. ; Huang, Thomas S.

  • Author_Institution
    Univ. of Illinois at Urbana-Champaign, Urbana
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We propose a complete solution to full modality person-profiling for speakers and submodality person-profiling for non-speakers in real-world videos. This is a step towards building an elaborate database efface, name and voice correspondence for speakers appearing in the news videos. In addition we are also interested in only name and face correspondence database for non-speakers who appear during voice-overs. We use an unsupervised technique for creating a speaker identification database and a unique primary feature matching and parallel line matching algorithm for creating a non-speaker identification database. We tested our approach on real world data and the results show good performance for news videos. It can be incorporated as part of a larger multimedia news video analysis system or a multimedia search system for efficient news video retrieval and browsing.
  • Keywords
    audio databases; speaker recognition; video databases; video retrieval; face correspondence database; feature matching; full modality person-profiling; multimedia news video analysis system; multimedia search system; multimodality framework; news video retrieval; nonspeaker identification database; parallel line matching algorithm; real-world video; speaker identification database; speaker-nonspeaker profile databases; submodality person-profiling; unsupervised technique; Automatic speech recognition; Digital multimedia broadcasting; Face detection; Information analysis; Multimedia communication; Multimedia databases; Multimedia systems; Radio broadcasting; Spatial databases; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383493
  • Filename
    4270491