Title :
Multimedia Summarization for Social Events in Microblog Stream
Author :
Jingwen Bian ; Yang Yang ; Hanwang Zhang ; Tat-Seng Chua
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Microblogging services have revolutionized the way people exchange information. Confronted with the ever-increasing numbers of social events and the corresponding microblogs with multimedia contents, it is desirable to provide visualized summaries to help users to quickly grasp the essence of these social events for better understanding. While existing approaches mostly focus only on text-based summary, microblog summarization with multiple media types (e.g., text, image, and video) is scarcely explored. In this paper, we propose a multimedia social event summarization framework to automatically generate visualized summaries from the microblog stream of multiple media types. Specifically, the proposed framework comprises three stages, as follows. 1) A noise removal approach is first devised to eliminate potentially noisy images. An effective spectral filtering model is exploited to estimate the probability that an image is relevant to a given event. 2) A novel cross-media probabilistic model, termed Cross-Media-LDA (CMLDA), is proposed to jointly discover subevents from microblogs of multiple media types. The intrinsic correlations among these different media types are well explored and exploited for reinforcing the cross-media subevent discovery process. 3) Finally, based on the cross-media knowledge of all the discovered subevents, a multimedia microblog summary generation process is designed to jointly identify both representative textual and visual samples, which are further aggregated to form a holistic visualized summary. We conduct extensive experiments on two real-world microblog datasets to demonstrate the superiority of the proposed framework as compared to the state-of-the-art approaches.
Keywords :
Web sites; data visualisation; document handling; multimedia computing; probability; CMLDA; Cross-Media-LDA; cross-media knowledge; cross-media probabilistic model; cross-media subevent discovery process; holistic visualized summary; image relevance probability estimation; information exchange; media type correlation; microblog dataset; microblog stream; microblog summarization; microblogging services; multimedia content; multimedia microblog summary generation process; multimedia social event summarization framework; multiple media types; noise removal approach; potentially noisy image elimination; representative textual samples; representative visual samples; spectral filtering model; visualized summaries; Feature extraction; Media; Multimedia communication; Noise measurement; Semantics; Streaming media; Visualization; Microblog; multimedia summarization; social event;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2014.2384912