• DocumentCode
    668379
  • Title

    Integration and collection of heterogeneous data based on metedata

  • Author

    Liwei Zhang

  • Author_Institution
    Inst. of Sci. & Tech. Inf. of China, Beijing, China
  • Volume
    1
  • fYear
    2013
  • fDate
    23-24 Nov. 2013
  • Firstpage
    205
  • Lastpage
    208
  • Abstract
    In order to solve the problem of heterogeneous data collection and integration in different industries, metadata-based retrieval model and rule-based web wrapper are proposed. Retrieval metadata set and industry retrieval model are constructed by applying metadata extraction technology, and the rule tree of industry data attributes is built and web wrapper is designed based on industry data format analysis. On the basis of above work, industry data collection system is realized to complete industry data collection. Through the study, researchers can obtain research data rapidly and accurately.
  • Keywords
    Internet; data analysis; distributed databases; information retrieval; meta data; heterogeneous data collection; heterogeneous data integration; industry data attributes; industry data collection system; industry data format analysis; industry retrieval model; metadata extraction technology; metadata-based retrieval model; metedata; retrieval metadata set; rule tree; rule-based Web wrapper; Data collection; Data mining; Data models; Industries; Patents; Search engines; Web pages; retrieval metadata; retrieval model; rule tree; web wrapper;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Management, Innovation Management and Industrial Engineering (ICIII), 2013 6th International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-4799-3985-5
  • Type

    conf

  • DOI
    10.1109/ICIII.2013.6702910
  • Filename
    6702910