Title : 
Predicting Flu Trends using Twitter data
         
        
            Author : 
Achrekar, Harshavardhan ; Gandhe, Avinash ; Lazarus, Ross ; Yu, Ssu-Hsin ; Liu, Benyuan
         
        
            Author_Institution : 
Dept. of Comput. Sci., Univ. of Massachusetts Lowell, Lowell, MA, USA
         
        
        
        
        
        
            Abstract : 
Reducing the impact of seasonal influenza epidemics and other pandemics such as the H1N1 is of paramount importance for public health authorities. Studies have shown that effective interventions can be taken to contain the epidemics if early detection can be made. Traditional approach employed by the Centers for Disease Control and Prevention (CDC) includes collecting influenza-like illness (ILI) activity data from “sentinel” medical practices. Typically there is a 1-2 week delay between the time a patient is diagnosed and the moment that data point becomes available in aggregate ILI reports. In this paper we present the Social Network Enabled Flu Trends (SNEFT) framework, which monitors messages posted on Twitter with a mention of flu indicators to track and predict the emergence and spread of an influenza epidemic in a population. Based on the data collected during 2009 and 2010, we find that the volume of flu related tweets is highly correlated with the number of ILI cases reported by CDC. We further devise auto-regression models to predict the ILI activity level in a population. The models predict data collected and published by CDC, as the percentage of visits to “sentinel” physicians attributable to ILI in successively weeks. We test models with previous CDC data, with and without measures of Twitter data, showing that Twitter data can substantially improve the models prediction accuracy. Therefore, Twitter data provides real-time assessment of ILI activity.
         
        
            Keywords : 
data handling; diseases; medical computing; regression analysis; social networking (online); Centers for Disease Control and Prevention; H1N1; ILI activity; Twitter data; autoregression models; flu trend prediction; influenza epidemics; influenza-like illness activity data; public health authorities; social network enabled flu trends framework; Correlation; Data models; Delay; Medical services; Predictive models; Real time systems; Twitter;
         
        
        
        
            Conference_Titel : 
Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on
         
        
            Conference_Location : 
Shanghai
         
        
            Print_ISBN : 
978-1-4577-0249-5
         
        
            Electronic_ISBN : 
978-1-4577-0248-8
         
        
        
            DOI : 
10.1109/INFCOMW.2011.5928903