DocumentCode
3479923
Title
An ontology-supported web focused-crawler for Java programs
Author
Yang, Sheng-yuan ; Hsu, Chun-liang
Author_Institution
Dept. of Comput. & Commun. Eng., St. John´´s Univ., Taipei, Taiwan
fYear
2010
fDate
5-6 July 2010
Firstpage
266
Lastpage
271
Abstract
This paper proposed an ontology-support web focused-crawler: OntoCrawler III for Java programs, in which only the user entered some keywords would the system supported by the domain ontology actively provide comparison and verification for those keywords so as to up-rise the precision and recall rates of webpage searching. This technique has practically been installed in Google and Yahoo search engines and furthermore searched and filtered out unduplicated and related Java open source webpages and accordingly downloaded and stored the results into a database to let the backend systems to do advanced processes. The preliminary experiment outcomes proved the OntoCrawler III based on ontology-supported techniques proposed in this paper could not only really up-rise the precision and recall rates of webpage searching but also should successfully download related webpage information.
Keywords
Internet; Java; information retrieval; search engines; Google search engine; Java open source Webpages; Java program; OntoCrawler III; Webpage searching; Yahoo search engines; ontology supported Web focused crawler; Crawlers; Databases; Electronic mail; Explosions; Information filtering; Information filters; Internet; Java; Ontologies; Search engines;
fLanguage
English
Publisher
ieee
Conference_Titel
Ubi-media Computing (U-Media), 2010 3rd IEEE International Conference on
Conference_Location
Jinhua
Print_ISBN
978-1-4244-6708-2
Type
conf
DOI
10.1109/UMEDIA.2010.5544448
Filename
5544448
Link To Document