Title :
Identification of coreferential chains in video texts for semantic annotation of news videos
Author :
Küçük, Dilek ; Yazici, Adnan
Author_Institution :
Power Electron. Group, Uzay Inst., Ankara
Abstract :
People can benefit from todaypsilas video archives of huge sizes only through appropriate and effective ways of querying the video data. In order to query the video data through high-level semantic entities such as objects, events, and relations, these entities should be properly extracted and the corresponding video shots should be annotated accordingly. Video texts, which comprise the caption texts on the frames as well as transcription texts obtained through automatic speech recognition techniques, are valuable sources of information for semantic modeling of the videos. In this paper, we present an approach for the extraction of semantic objects from videos by utilizing lexical resources along with the identification of coreference chains in the corresponding video texts. Coreference is a phenomenon in natural language texts where a number of entities in the text refer to the same real world entity. Therefore, while the domain-specific lexical resources aid in the determination of salient entities in the video text, the identification of coreference chains prevents the superfluous extraction of the same underlying entities due to their different surface forms in the video texts. The proposed approach is significant for its being the first attempt to address the importance of coreference phenomenon in video texts for precise entity extraction during the semantic modeling of news videos with a hands-on application. The approach has been evaluated on Turkish political news texts from the METU Turkish corpus and a number of evaluation problems faced such as sparseness of annotated evaluation data for Turkish are also pointed out together with further research directions to pursue.
Keywords :
feature extraction; query processing; speech recognition; text analysis; video signal processing; automatic speech recognition techniques; domain-specific lexical resources; natural language texts; news videos; semantic annotation; transcription texts; video text coreferential chains; Automatic speech recognition; Broadcasting; Data engineering; Data mining; Data models; Information resources; Natural languages; Power electronics; Power engineering and energy; Power engineering computing;
Conference_Titel :
Computer and Information Sciences, 2008. ISCIS '08. 23rd International Symposium on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-2880-9
Electronic_ISBN :
978-1-4244-2881-6
DOI :
10.1109/ISCIS.2008.4717886