Title :
Mapping the Blogosphere--Towards a Universal and Scalable Blog-Crawler
Author :
Berger, Philipp ; Hennig, Patrick ; Bross, Justus ; Meinel, Christoph
Author_Institution :
IT-Syst. Eng., Hasso-Plattner Inst., Potsdam, Germany
Abstract :
The massive adoption of social media has provided new ways for individuals to express their opinions online. The blogosphere, an inherent part of this trend, contains a vast array of information about a variety of topics. Thus, it is a huge think tank that creates an enormous and ever-changing archive of open source intelligence. Modeling and mining this vast pool of data to extract and describe meaningful knowledge in order to leverage (content-related) structures and dynamics of emerging networks within the blogo sphere is the higher-level aim of the research presented here. While the concept of our tailor-mode feed-crawler was already discussed in two earlier publications this paper focuses on our approach to extend the earlier feed crawler to a more universal and highly scalable blog-crawler.
Keywords :
social networking (online); blog-crawler; blogosphere; open source intelligence; social media; tailor-mode feed-crawler; Blogs; Crawlers; Data mining; Databases; Feeds; HTML; Hardware; Blog; Blogosphere; Data Mining; Hasso Plattner; In-Memory; MapReduce; Ranking; Social Media Monitoring;
Conference_Titel :
Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4577-1931-8
DOI :
10.1109/PASSAT/SocialCom.2011.57