• DocumentCode
    568231
  • Title

    A mobile phone information search engine based on Heritrix and Lucene

  • Author

    Chen, Jianxia ; Wu, Wei ; Wang, Chunzhi

  • Author_Institution
    Sch. of Comput. Sci., Hubei Univ. of Technol., Wuhan, China
  • fYear
    2012
  • fDate
    14-17 July 2012
  • Firstpage
    1602
  • Lastpage
    1604
  • Abstract
    With the rapid spread of web application, the search engine based on personalized services becomes more important in the society. The paper proposes an approach to design and implement a personalized search engine based on Heritrix and Lucene. In particular, the design a web crawler of mobile information with URL hashing algorithm, achieving the multithread and efficient web crawl. Experimental results show the efficient performance.
  • Keywords
    Internet; file organisation; information retrieval; mobile handsets; multi-threading; search engines; Heritrix; Lucene; URL hashing algorithm; Web crawler; mobile phone information search engine; multithread; personalized services; Algorithm design and analysis; Computer science; Crawlers; Education; Indexes; Optimization; Search engines; Heritrix; Lucene; Search Engine; Web Crawler;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science & Education (ICCSE), 2012 7th International Conference on
  • Conference_Location
    Melbourne, VIC
  • Print_ISBN
    978-1-4673-0241-8
  • Type

    conf

  • DOI
    10.1109/ICCSE.2012.6295370
  • Filename
    6295370