DocumentCode
178102
Title
Boosted Multi-modal Supervised Latent Dirichlet Allocation for Social Event Classification
Author
Shengsheng Qian ; Tianzhu Zhang ; Changsheng Xu
Author_Institution
Inst. of Autom., Beijing, China
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
1999
Lastpage
2004
Abstract
With the rapidly increasing popularity of Social Media sites (e.g., Flickr, YouTube, and Facebook), it is convenient for users to share their own comments on many social events, which successfully facilitates social event generation, sharing and propagation and results in a large amount of user-contributed media data (e.g., images, videos, and texts) for a wide variety of real-world events of different types and scales. As a consequence, it has become more and more difficult to find exactly the interesting events from massive social media data, which is useful to browse, search and monitor social events by users or governments. To deal with these issues, we propose a novel boosted multi-modal supervised Latent Dirichlet Allocation (BMM-SLDA) for social event classification. Our BMM-SLDA has a number of advantages. (1) It can effectively exploit the multi-modality and the supervised information of social events jointly. (2) It is suitable to large-scale data analysis by utilizing boosting weighted sampling strategy to iteratively select a small subset data to efficiently train the corresponding topic models. (3) It effectively exploits boosting document weight distribution by classification error, and can iteratively learn new topic model to correct the previously misclassified documents. We evaluate our BMM-SLDA on a real-world dataset and show extensive results, which show that our model outperforms state-of-the-art methods.
Keywords
data analysis; social networking (online); BMM-SLDA; Facebook; Flickr; YouTube; boosted multimodal supervised latent Dirichlet allocation; classification error; document weight distribution; large-scale data analysis; massive social media data; real-world dataset; social event classification; social media sites; social propagation; social sharing; topic models; user-contributed media data; weighted sampling strategy; Analytical models; Boosting; Media; Resource management; Training; Training data; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.349
Filename
6977061
Link To Document