Title :
State of the art in metadata abstraction crawlers
Author :
Dong, Hai ; Hussain, Farookh Khadeer ; Chang, Elizabeth
Author_Institution :
Digital Ecosyst. & Bus. Intell. Inst., Curtin Univ. of Technol., Perth, WA
Abstract :
Nowadays, the research of crawlers moves closer to the semantic web, along with the appearance of increasing XML/RDF/OWL files and the rapid development of ontology mark-up languages. As an emerging concept, metadata abstraction crawlers are a series of crawlers that aim to abstract metadata from normal HTML documents, based on various semantic Web technologies. In this paper, we make a general survey of the current situation of metadata abstraction crawlers. Fourteen cases in this field are chosen as typical examples, and classified in five clusters. From seven perspectives we horizontally compare and contrast the semantic Web crawlers in each cluster, and draw our conclusion in the final section.
Keywords :
data structures; meta data; semantic Web; OWL files; RDF; XML; metadata abstraction crawlers; ontology mark-up languages; semantic Web technologies; Australia; Crawlers; Ecosystems; HTML; OWL; Ontologies; Organizing; Resource description framework; Semantic Web; XML; OAI-PMH; RDF crawlers; focused crawlers; metadata abstraction; semantic web crawlers;
Conference_Titel :
Industrial Technology, 2008. ICIT 2008. IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-1705-6
Electronic_ISBN :
978-1-4244-1706-3
DOI :
10.1109/ICIT.2008.4608573