DocumentCode :
3727184
Title :
A robust algorithm for determining the newsworthiness of microblogs
Author :
P. K. K. Madhawa;Ajantha S. Atukorale
Author_Institution :
University of Colombo School of Computing, No 35, Reid Avenue, 00700, Sri Lanka
fYear :
2015
Firstpage :
135
Lastpage :
139
Abstract :
Microblogging platforms such as Twitter have become a primary medium for people to share their experiences and opinions on a broad range of topics. Because posts on Twitter are publicly viewable by default, Twitter can be used to get up-to-date information on events like natural disasters, disease outbreaks or sports events. Building a cohesive summary out of tweets on long running events is an interesting problem which research community is interested in. But the abundance of tweets containing user opinions and their sentiments towards a topic necessitates the need of extracting newsworthy tweets from a large stream of tweets on a single topic. But most of such methods require large hand-labeled corpora to be used for training the model. But this is not practical for a rapidly updating medium like Twitter. In this paper we address this problem with the introduction of a novel heuristic based annotation scheme to generate training dataset for the system. A hand-labeled corpus of tweets is only used for benchmarking the objectivity classifier. Our classifier could achieve an F1-score of 80% on a manually annotated gold standard dataset.
Keywords :
"Support vector machines","Irrigation","Decision support systems","Training","Gold","Standards","Logistics"
Publisher :
ieee
Conference_Titel :
Advances in ICT for Emerging Regions (ICTer), 2015 Fifteenth International Conference on
Print_ISBN :
978-1-4673-9440-6
Type :
conf
DOI :
10.1109/ICTER.2015.7377679
Filename :
7377679
Link To Document :
بازگشت