DocumentCode :
2503238
Title :
Mapping the Blogosphere with RSS-Feeds
Author :
Bross, Justus ; Quasthoff, Matthias ; Berger, Philipp ; Hennig, Patrick ; Meinel, Christoph
Author_Institution :
Hasso-Plattner Inst., Univ. of Potsdam, Potsdam, Germany
fYear :
2010
fDate :
20-23 April 2010
Firstpage :
453
Lastpage :
460
Abstract :
The massive adoption of social media has provided new ways for individuals to express their opinions online. The blogosphere, an inherent part of this trend, contains a vast array of information about a variety of topics. It is thus a huge think tank that creates an enormous and ever-changing archive of open source intelligence. Modeling and mining this vast pool of data to extract, exploit and describe meaningful knowledge in order to leverage (content-related) structures and dynamics of emerging networks within the blogosphere is the higher-level aim of the research presented here. This paper focuses on this project´s initial phase, in which the above-mentioned data of interest needs to be collected and made available offline for further analyses. Our proprietary development of a tailor-made feed-crawler meets exactly this need. The main concept, the techniques and the implementation details of the crawler thus form the main interest of this paper and furthermore provide the basis for future project phases.
Keywords :
Web sites; content management; RSS feeds; blogosphere mapping; open source intelligence; social media; tailor made feed crawler; Centralized control; Content management; Crawlers; Data mining; Feeds; Intelligent networks; Intelligent robots; Intelligent structures; Internet; Social network services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications (AINA), 2010 24th IEEE International Conference on
Conference_Location :
Perth, WA
ISSN :
1550-445X
Print_ISBN :
978-1-4244-6695-5
Type :
conf
DOI :
10.1109/AINA.2010.95
Filename :
5474737
Link To Document :
بازگشت