• DocumentCode
    2875302
  • Title

    Discovering Irrelevance in the Blogosphere through Blog Search

  • Author

    Qureshi, M. Atif ; Younus, Arjumand ; Touheed, Nasir ; Qureshi, M. Shahid ; Saeed, Muhammad

  • Author_Institution
    Fac. of Comput. Sci., Inst. of Bus. Adm., Karachi, Pakistan
  • fYear
    2011
  • fDate
    25-27 July 2011
  • Firstpage
    457
  • Lastpage
    460
  • Abstract
    Web 2.0 technologies have given birth to the blogosphere, which is an information sharing medium by the users for the users. Furthermore, these technologies have also expanded the search problem to a new form of search known as blog search. Similar to Web search, blog search has been affected by spam which affects the quality of search results. This paper approaches the relevant blog problem in the top search results against the general topic queries. It pursues a study of irrelevant blogs appearing in the top search results of Google Blog Search for the blog spot domains. We define metrics for irrelevant blogs by observing the qualitative relevance of content and by analyzing the link structure of those blogs. Our preliminary results show an overall recall of 0.875 with a precision of 1.0 for finding irrelevant blogs in the top 15 search results against six general topic queries on Google Blog Search.
  • Keywords
    Internet; Web sites; data mining; query processing; search engines; search problems; unsolicited e-mail; Google blog search; Web 2.0 technologies; Web search; blogosphere irrelevance discovering; general topic queries; information sharing medium; search problem; spam; Blogs; Communities; Google; Measurement; Search engines; Search problems; Social network services; blog search; blogosphere; content-based; irrelevance; link structure; splogs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on
  • Conference_Location
    Kaohsiung
  • Print_ISBN
    978-1-61284-758-0
  • Electronic_ISBN
    978-0-7695-4375-8
  • Type

    conf

  • DOI
    10.1109/ASONAM.2011.84
  • Filename
    5992614