DocumentCode
151877
Title
Automatic identification of alcohol-related promotions on Twitter and prediction of promotion spread
Author
Menon, Ashok ; Farmer, Fallon ; Whalen, Thomas ; Beini Hua ; Najib, Kareem ; Gerber, Mariana
Author_Institution
Dept. of Syst. & Inf. Eng., Univ. of Virginia, Charlottesville, VA, USA
fYear
2014
fDate
25-25 April 2014
Firstpage
233
Lastpage
238
Abstract
Teens who have viewed alcohol-related content on social networking sites are more likely to have consumed alcohol than teens that have not seen such content. This suggests a rising concern about the influence of these sites on adolescent drinking behavior. Parents, health organizations, and school administrators need a deeper understanding of online promotional patterns in order to combat risky behaviors through intervention and education. To address these problems, we developed a system that automatically identifies alcohol promotions in online Twitter content. The identification of promotions was modeled using supervised machine learning algorithms. Predictor variables were derived from the content of tweets, the Twitter meta-data, and the network structure. We evaluated this system using held-out testing data in a cross-validated experimental design. We found that random forest models were best at predicting promotional tweets. Yet, logistic regression main effects models were useful in determining the significance of each variable, both Twitter specific and textual. For Twitter specific variables, number of hashtags and number of mentions significantly increased the likelihood of a tweet being a promotion. Using the TF-IDF method for textual predictors, we found that words that describe a type of alcohol, such as “beer” or “wine,” increased the likelihood of a tweet being a promotion. Our analysis provides information about the current state of online alcohol promotion, salient characteristics of promotions and promoters, and the influence of promotions on other users of social networking sites.
Keywords
behavioural sciences; beverages; design of experiments; learning (artificial intelligence); meta data; promotion (marketing); random processes; regression analysis; social networking (online); TF-IDF method; Twitter meta-data; Twitter specific variables; adolescent drinking behavior; alcohol type; alcohol-related content; alcohol-related promotions; automatic promotions identification; beer; cross-validated experimental design; education; hashtags; held-out testing data; intervention; logistic regression; network structure; online Twitter content; online alcohol promotion; online promotional patterns; predictor variables; promotion spread prediction; promotional tweets; random forest models; risky behaviors; social networking sites; supervised machine learning algorithms; teens; textual predictors; wine; Alcoholic beverages; Data models; Logistics; Media; Predictive models; Training; Twitter; Predictive modeling; Social media; Substance use promotion; Text modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems and Information Engineering Design Symposium (SIEDS), 2014
Conference_Location
Charlottesville, VA
Print_ISBN
978-1-4799-4837-6
Type
conf
DOI
10.1109/SIEDS.2014.6829912
Filename
6829912
Link To Document