Author_Institution :
IRIS, Inst. for Language & Speech Process., Athens, Greece
Abstract :
REVEAL THIS addresses a basic need underlying content organisation, filtering, consumption and enjoyment by developing content programming systems that will help European citizens keep up with the explosion of digital content scattered over different platforms (radio, TV, World Wide Web, etc), different media (speech, text, image, video) and different languages. REVEAL THIS aims at developing content programming technology able to capture, semantically index, categorize and cross-link multiplatform, multimedia and multilingual digital content, as well as provide the system user with semantic search, retrieval, summarization and translation functionalities. The innovative aspects of the project spring out of the main scientific and technological challenges: 1) semantic enrichment of multilingual multimedia content with topic, entity and fact information relevant to user profiles; 2) development of suitable cross-language, cross-media representations; and 3) deployment of the above in building search, retrieval, classification and summarization capabilities. We exploit explicit cross-media links and implicit links uncovered by methods such as latent semantic analysis and kernel canonical correlation analysis. Adequate cross-language capabilities (cross-language information retrieval, categorization and machine translation of indicative summaries) will be provided by the latest statistical machine translation technology.
Keywords :
content-based retrieval; language translation; multimedia systems; statistical analysis; text analysis; content programming system; cross-language information retrieval; cross-link multiplatform; cross-media representation; information society; kernel canonical correlation analysis; latent semantic analysis; multilingual digital content; multimedia multilingual content retrieval; semantic retrieval; semantic search; semantic summarization; statistical machine translation technology; translation functionality;
Conference_Titel :
Integration of Knowledge, Semantics and Digital Media Technology, 2005. EWIMT 2005. The 2nd European Workshop on the (Ref. No. 2005/11099)