Title :
Integration and collection of heterogeneous data based on metedata
Author_Institution :
Inst. of Sci. & Tech. Inf. of China, Beijing, China
Abstract :
In order to solve the problem of heterogeneous data collection and integration in different industries, metadata-based retrieval model and rule-based web wrapper are proposed. Retrieval metadata set and industry retrieval model are constructed by applying metadata extraction technology, and the rule tree of industry data attributes is built and web wrapper is designed based on industry data format analysis. On the basis of above work, industry data collection system is realized to complete industry data collection. Through the study, researchers can obtain research data rapidly and accurately.
Keywords :
Internet; data analysis; distributed databases; information retrieval; meta data; heterogeneous data collection; heterogeneous data integration; industry data attributes; industry data collection system; industry data format analysis; industry retrieval model; metadata extraction technology; metadata-based retrieval model; metedata; retrieval metadata set; rule tree; rule-based Web wrapper; Data collection; Data mining; Data models; Industries; Patents; Search engines; Web pages; retrieval metadata; retrieval model; rule tree; web wrapper;
Conference_Titel :
Information Management, Innovation Management and Industrial Engineering (ICIII), 2013 6th International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-3985-5
DOI :
10.1109/ICIII.2013.6702910