DocumentCode
3022057
Title
A Multimodality Framework for Creating Speaker/Non-Speaker Profile Databases for Real-World Video
Author
Abbas, Jehanzeb ; Dagli, Charlie K. ; Huang, Thomas S.
Author_Institution
Univ. of Illinois at Urbana-Champaign, Urbana
fYear
2007
fDate
17-22 June 2007
Firstpage
1
Lastpage
8
Abstract
We propose a complete solution to full modality person-profiling for speakers and submodality person-profiling for non-speakers in real-world videos. This is a step towards building an elaborate database efface, name and voice correspondence for speakers appearing in the news videos. In addition we are also interested in only name and face correspondence database for non-speakers who appear during voice-overs. We use an unsupervised technique for creating a speaker identification database and a unique primary feature matching and parallel line matching algorithm for creating a non-speaker identification database. We tested our approach on real world data and the results show good performance for news videos. It can be incorporated as part of a larger multimedia news video analysis system or a multimedia search system for efficient news video retrieval and browsing.
Keywords
audio databases; speaker recognition; video databases; video retrieval; face correspondence database; feature matching; full modality person-profiling; multimedia news video analysis system; multimedia search system; multimodality framework; news video retrieval; nonspeaker identification database; parallel line matching algorithm; real-world video; speaker identification database; speaker-nonspeaker profile databases; submodality person-profiling; unsupervised technique; Automatic speech recognition; Digital multimedia broadcasting; Face detection; Information analysis; Multimedia communication; Multimedia databases; Multimedia systems; Radio broadcasting; Spatial databases; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location
Minneapolis, MN
ISSN
1063-6919
Print_ISBN
1-4244-1179-3
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2007.383493
Filename
4270491
Link To Document