DocumentCode
2875302
Title
Discovering Irrelevance in the Blogosphere through Blog Search
Author
Qureshi, M. Atif ; Younus, Arjumand ; Touheed, Nasir ; Qureshi, M. Shahid ; Saeed, Muhammad
Author_Institution
Fac. of Comput. Sci., Inst. of Bus. Adm., Karachi, Pakistan
fYear
2011
fDate
25-27 July 2011
Firstpage
457
Lastpage
460
Abstract
Web 2.0 technologies have given birth to the blogosphere, which is an information sharing medium by the users for the users. Furthermore, these technologies have also expanded the search problem to a new form of search known as blog search. Similar to Web search, blog search has been affected by spam which affects the quality of search results. This paper approaches the relevant blog problem in the top search results against the general topic queries. It pursues a study of irrelevant blogs appearing in the top search results of Google Blog Search for the blog spot domains. We define metrics for irrelevant blogs by observing the qualitative relevance of content and by analyzing the link structure of those blogs. Our preliminary results show an overall recall of 0.875 with a precision of 1.0 for finding irrelevant blogs in the top 15 search results against six general topic queries on Google Blog Search.
Keywords
Internet; Web sites; data mining; query processing; search engines; search problems; unsolicited e-mail; Google blog search; Web 2.0 technologies; Web search; blogosphere irrelevance discovering; general topic queries; information sharing medium; search problem; spam; Blogs; Communities; Google; Measurement; Search engines; Search problems; Social network services; blog search; blogosphere; content-based; irrelevance; link structure; splogs;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on
Conference_Location
Kaohsiung
Print_ISBN
978-1-61284-758-0
Electronic_ISBN
978-0-7695-4375-8
Type
conf
DOI
10.1109/ASONAM.2011.84
Filename
5992614
Link To Document