• DocumentCode
    3479923
  • Title

    An ontology-supported web focused-crawler for Java programs

  • Author

    Yang, Sheng-yuan ; Hsu, Chun-liang

  • Author_Institution
    Dept. of Comput. & Commun. Eng., St. John´´s Univ., Taipei, Taiwan
  • fYear
    2010
  • fDate
    5-6 July 2010
  • Firstpage
    266
  • Lastpage
    271
  • Abstract
    This paper proposed an ontology-support web focused-crawler: OntoCrawler III for Java programs, in which only the user entered some keywords would the system supported by the domain ontology actively provide comparison and verification for those keywords so as to up-rise the precision and recall rates of webpage searching. This technique has practically been installed in Google and Yahoo search engines and furthermore searched and filtered out unduplicated and related Java open source webpages and accordingly downloaded and stored the results into a database to let the backend systems to do advanced processes. The preliminary experiment outcomes proved the OntoCrawler III based on ontology-supported techniques proposed in this paper could not only really up-rise the precision and recall rates of webpage searching but also should successfully download related webpage information.
  • Keywords
    Internet; Java; information retrieval; search engines; Google search engine; Java open source Webpages; Java program; OntoCrawler III; Webpage searching; Yahoo search engines; ontology supported Web focused crawler; Crawlers; Databases; Electronic mail; Explosions; Information filtering; Information filters; Internet; Java; Ontologies; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Ubi-media Computing (U-Media), 2010 3rd IEEE International Conference on
  • Conference_Location
    Jinhua
  • Print_ISBN
    978-1-4244-6708-2
  • Type

    conf

  • DOI
    10.1109/UMEDIA.2010.5544448
  • Filename
    5544448