Title :
Extracting situational awareness from microblogs during disaster events
Author :
Sen, Anirban ; Rudra, Koustav ; Ghosh, Saptarshi
Author_Institution :
Dept. of Comput. Sci. & Technol., Indian Inst. of Eng. Sci. & Technol. Shibpur, Howrah, India
Abstract :
Microblogging sites such as Twitter and Weibo are increasingly being used to enhance situational awareness during various natural and man-made disaster events such as floods, earthquakes, and bomb blasts. During any such event, thousands of microblogs (tweets) are posted in short intervals of time. Typically, only a small fraction of these tweets contribute to situational awareness, while the majority merely reflect the sentiment or opinion of people. Real-time extraction of tweets that contribute to situational awareness is especially important for relief operations when time is critical. However, automatically differentiating such tweets from those that reflect opinion / sentiment is a non-trivial challenge, mainly because of the very small size of tweets and the informal way in which tweets are written (frequent use of emoticons, abbreviations, and so on). This study applies Natural Language Processing (NLP) techniques to address this challenge. We extract low-level syntactic features from the text of tweets, such as the presence of specific types of words and parts-of-speech, to develop a classifier to distinguish between tweets which contribute to situational awareness and tweets which do not. Experiments over tweets related to four diverse disaster events show that the proposed features identify situational awareness tweets with significantly higher accuracy than classifiers based on standard bag-of-words models.
Keywords :
emergency management; natural language processing; pattern classification; social networking (online); text analysis; NLP techniques; Twitter; Weibo; bomb blasts; classifier accuracy; earthquakes; floods; low-level syntactic feature extraction; man-made disaster events; microblogging sites; natural disaster; natural language processing; parts-of-speech; real-time tweet extraction; relief operations; situational awareness enhancement; situational awareness extraction; tweet text; Accuracy; Data mining; Feature extraction; Pragmatics; Speech; Speech enhancement; Training;
Conference_Titel :
Communication Systems and Networks (COMSNETS), 2015 7th International Conference on
Conference_Location :
Bangalore
DOI :
10.1109/COMSNETS.2015.7098720