Title :
Large-scale socio-demographic pattern discovery on microblog metadata
Author :
Cheong, Minho ; Ray, Sambaran ; Green, Dale
Author_Institution :
Clayton Sch. of IT, Monash Univ., Clayton, VIC, Australia
Abstract :
Microblogging services, such as Twitter, generate huge volumes of data reflecting the current zeitgeist. As such they are of enormous potential value to studies ranging from data mining to social anthropology. To realize the potential, this study investigates improvements of algorithms specifically tailored for the discovery of latent socio-demographic patterns in Twitter metadata. These newly improved hybrid algorithms improve on existing ones in terms of speed and scalability (from thousands of records to millions). Testing on a real-world Twitter data set (~7.4 million messages) reveals emergent patterns in global day-to-day Twitter activity. The results demonstrate novel insight when applied to real-world Twitter data, practical large-scale applications of the methods, and suggest potential areas of future research.
Keywords :
Internet; data mining; meta data; social aspects of automation; social networking (online); Microblogging services; Twitter metadata; data mining; large scale sociodemographic pattern discovery; microblog metadata; social anthropology; Accuracy; Educational institutions; Inference algorithms; Mobile communication; Software; Twitter; Twitter; demographics; message metadata; microblogging; social networking; user metadata;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-5117-1
DOI :
10.1109/ISDA.2012.6416659