DocumentCode :
1884314
Title :
Predicting Flu Trends using Twitter data
Author :
Achrekar, Harshavardhan ; Gandhe, Avinash ; Lazarus, Ross ; Yu, Ssu-Hsin ; Liu, Benyuan
Author_Institution :
Dept. of Comput. Sci., Univ. of Massachusetts Lowell, Lowell, MA, USA
fYear :
2011
fDate :
10-15 April 2011
Firstpage :
702
Lastpage :
707
Abstract :
Reducing the impact of seasonal influenza epidemics and other pandemics such as the H1N1 is of paramount importance for public health authorities. Studies have shown that effective interventions can be taken to contain the epidemics if early detection can be made. Traditional approach employed by the Centers for Disease Control and Prevention (CDC) includes collecting influenza-like illness (ILI) activity data from “sentinel” medical practices. Typically there is a 1-2 week delay between the time a patient is diagnosed and the moment that data point becomes available in aggregate ILI reports. In this paper we present the Social Network Enabled Flu Trends (SNEFT) framework, which monitors messages posted on Twitter with a mention of flu indicators to track and predict the emergence and spread of an influenza epidemic in a population. Based on the data collected during 2009 and 2010, we find that the volume of flu related tweets is highly correlated with the number of ILI cases reported by CDC. We further devise auto-regression models to predict the ILI activity level in a population. The models predict data collected and published by CDC, as the percentage of visits to “sentinel” physicians attributable to ILI in successively weeks. We test models with previous CDC data, with and without measures of Twitter data, showing that Twitter data can substantially improve the models prediction accuracy. Therefore, Twitter data provides real-time assessment of ILI activity.
Keywords :
data handling; diseases; medical computing; regression analysis; social networking (online); Centers for Disease Control and Prevention; H1N1; ILI activity; Twitter data; autoregression models; flu trend prediction; influenza epidemics; influenza-like illness activity data; public health authorities; social network enabled flu trends framework; Correlation; Data models; Delay; Medical services; Predictive models; Real time systems; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4577-0249-5
Electronic_ISBN :
978-1-4577-0248-8
Type :
conf
DOI :
10.1109/INFCOMW.2011.5928903
Filename :
5928903
Link To Document :
بازگشت