Title :
Automated Metadata Extraction from Web Sources
Author :
Yahaya, Nor Adnan ; Buang, Rosiza
Author_Institution :
Malaysia Univ. of Sci. & Technol.
Abstract :
This paper discusses the application of Web wrapping technology in extracting metadata from Web sources. This capability has been incorporated into a software tool known as dynamic Dublin core/resource description framework metadata editor (DDC/RDF-Editor) which supports metadata development and management for resources in the World Wide Web. One key feature of the editor is the ability to automatically extract relevant values of metadata elements from the Web sources in question according to the Dublin core (DC) metadata standard and represent it in resource description framework (RDF) language
Keywords :
XML; meta data; semantic Web; Web sources; Web wrapping technology; World Wide Web; automated metadata extraction; dynamic Dublin core; resource description framework language; resource description framework metadata editor; DC generators; Data mining; Encoding; HTML; Markup languages; Resource description framework; Standards development; Vocabulary; Wrapping; XML;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology Workshops, 2006. WI-IAT 2006 Workshops. 2006 IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2749-3
DOI :
10.1109/WI-IATW.2006.49