Title :
Odaies: ontology-driven adaptive Web information extraction system
Author :
Zhang, Ning ; Chen, Hong ; Wang, Yu ; Cheng, Shi-Jun ; Xiong, Ming-Feng
Author_Institution :
Dept. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Abstract :
This paper proposes an ontology-driven self-adapting approach in the semi-structured Web information extraction field, where ontology provides semantic support and plays a central role during the extraction process. It outperforms traditional wrapper systems in adaptiveness and maintenance. Firstly, we build a domain-dependant ontology. Then we design three templates generating algorithms, which have self-adaptiveness and self-maintenance based on the ontology, to perform Web page information extraction. Experiment results show that our prototype system can achieve 100% recall and 97.64% precision.
Keywords :
Web sites; adaptive systems; data mining; information retrieval; Odaies; Web page; World Wide Web; domain-dependent ontology; knowledge discovery; ontology driven adaptive Web information extraction system; semantic support; wrapper system; Algorithm design and analysis; Computer science; Data mining; HTML; Hazards; Heuristic algorithms; Ontologies; Prototypes; Web pages; Web sites;
Conference_Titel :
Intelligent Agent Technology, 2003. IAT 2003. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1931-8
DOI :
10.1109/IAT.2003.1241120