DocumentCode :
3022057
Title :
A Multimodality Framework for Creating Speaker/Non-Speaker Profile Databases for Real-World Video
Author :
Abbas, Jehanzeb ; Dagli, Charlie K. ; Huang, Thomas S.
Author_Institution :
Univ. of Illinois at Urbana-Champaign, Urbana
fYear :
2007
fDate :
17-22 June 2007
Firstpage :
1
Lastpage :
8
Abstract :
We propose a complete solution to full modality person-profiling for speakers and submodality person-profiling for non-speakers in real-world videos. This is a step towards building an elaborate database efface, name and voice correspondence for speakers appearing in the news videos. In addition we are also interested in only name and face correspondence database for non-speakers who appear during voice-overs. We use an unsupervised technique for creating a speaker identification database and a unique primary feature matching and parallel line matching algorithm for creating a non-speaker identification database. We tested our approach on real world data and the results show good performance for news videos. It can be incorporated as part of a larger multimedia news video analysis system or a multimedia search system for efficient news video retrieval and browsing.
Keywords :
audio databases; speaker recognition; video databases; video retrieval; face correspondence database; feature matching; full modality person-profiling; multimedia news video analysis system; multimedia search system; multimodality framework; news video retrieval; nonspeaker identification database; parallel line matching algorithm; real-world video; speaker identification database; speaker-nonspeaker profile databases; submodality person-profiling; unsupervised technique; Automatic speech recognition; Digital multimedia broadcasting; Face detection; Information analysis; Multimedia communication; Multimedia databases; Multimedia systems; Radio broadcasting; Spatial databases; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location :
Minneapolis, MN
ISSN :
1063-6919
Print_ISBN :
1-4244-1179-3
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2007.383493
Filename :
4270491
Link To Document :
بازگشت