• DocumentCode
    2691807
  • Title

    Focused Crawling for Retrieving E-commerce Information Based on Learnable Ontology and Link Prediction

  • Author

    Huang, Wei ; Zhang, Liyi ; Zhang, Jidong ; Zhu, Mingzhu

  • Author_Institution
    Sch. of Inf. Manage., Wuhan Univ., Wuhan, China
  • fYear
    2009
  • fDate
    16-17 May 2009
  • Firstpage
    574
  • Lastpage
    579
  • Abstract
    With the rapid growth of the e-commerce, how to discovery the specific information such as about buyer, seller and products etc. adapting for the online business user becomes a focused issue to the information search engine. Focused crawling is proposed to selectively seek out pages that are relevant to a predefined set of topics without downloading all of the Web. We present a novel approach for building an intelligent focused crawler which deals with evaluating the pagepsilas content relevance to the e-commerce topic by the domain ontology and the hyperlinks connection to the commercial Web pages by link analysis. In the process of crawling, the domain ontology can evolve automatically by machine learning based on the statistics and rules. Experiments have been performed, and the results show that our approach is more effective than the other traditional crawling algorithms, and prevents the topic-drift with higher harvest rate.
  • Keywords
    electronic commerce; information retrieval; learning (artificial intelligence); ontologies (artificial intelligence); search engines; commercial Web pages; domain ontology; e-commerce; electronic commerce; hyperlinks connection; information retrieval; information search engine; intelligent focused crawler; learnable ontology; link prediction; machine learning; online business user; Business; Crawlers; Information retrieval; Intelligent structures; Learning systems; Machine learning; Ontologies; Search engines; Statistics; Web pages; E-commerce; Focused crawling; Information retrieval; Machine learning; Ontology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Engineering and Electronic Commerce, 2009. IEEC '09. International Symposium on
  • Conference_Location
    Ternopil
  • Print_ISBN
    978-0-7695-3686-6
  • Type

    conf

  • DOI
    10.1109/IEEC.2009.127
  • Filename
    5175184