Title :
Dynamic Seed Analysis in a Social Network for Maximizing Efficiency of Data Collection
Author :
Changhyun Byun ; Hyeoncheol Lee ; Jongsung You ; Yanggon Kim
Author_Institution :
Dept. of Comput. & Inf. Sci., Towson Univ., Towson, MD, USA
Abstract :
Applying data mining techniques to social media can yield interesting perspectives to understanding individual and human behavior, detecting hot issues and topics, or discovering a group and community. However, it is difficult to gather the data related to a specific topic due to the main characteristics of social media data sets: data is large, noisy, and dynamic. To collect the data related to a specific topic and keyword efficiently, we propose a new algorithm that selects the best seed nodes with limited resources and time. The algorithm also evaluates various user influence and activity factors, and updates the seed nodes dynamically during the gathering process. Furthermore, we compare two data sets collected by the algorithm and existing approaches.
Keywords :
data mining; human factors; social networking (online); activity factor evaluation; data collection efficiency maximization; data gathering; data mining techniques; dynamic seed analysis; dynamic seed node update; human behavior; individual behavior; large-noisy-dynamic data collection; seed node selection; social media data sets; social network; user influence evaluation; Algorithm design and analysis; Data mining; Heuristic algorithms; Knowledge discovery; Media; Twitter; social networks; twitter; seed analysis; initial nodes; crawling; presidential election 2012;;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2013 14th ACIS International Conference on
Conference_Location :
Honolulu, HI
DOI :
10.1109/SNPD.2013.45