Title :
Discovering Irrelevance in the Blogosphere through Blog Search
Author :
Qureshi, M. Atif ; Younus, Arjumand ; Touheed, Nasir ; Qureshi, M. Shahid ; Saeed, Muhammad
Author_Institution :
Fac. of Comput. Sci., Inst. of Bus. Adm., Karachi, Pakistan
Abstract :
Web 2.0 technologies have given birth to the blogosphere, which is an information sharing medium by the users for the users. Furthermore, these technologies have also expanded the search problem to a new form of search known as blog search. Similar to Web search, blog search has been affected by spam which affects the quality of search results. This paper approaches the relevant blog problem in the top search results against the general topic queries. It pursues a study of irrelevant blogs appearing in the top search results of Google Blog Search for the blog spot domains. We define metrics for irrelevant blogs by observing the qualitative relevance of content and by analyzing the link structure of those blogs. Our preliminary results show an overall recall of 0.875 with a precision of 1.0 for finding irrelevant blogs in the top 15 search results against six general topic queries on Google Blog Search.
Keywords :
Internet; Web sites; data mining; query processing; search engines; search problems; unsolicited e-mail; Google blog search; Web 2.0 technologies; Web search; blogosphere irrelevance discovering; general topic queries; information sharing medium; search problem; spam; Blogs; Communities; Google; Measurement; Search engines; Search problems; Social network services; blog search; blogosphere; content-based; irrelevance; link structure; splogs;
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-61284-758-0
Electronic_ISBN :
978-0-7695-4375-8
DOI :
10.1109/ASONAM.2011.84