• DocumentCode
    2624179
  • Title

    A Hybrid Object Matching Method for Deep Web Information Integration

  • Author

    Zhao, Pengpeng ; Lin, Chao ; Fang, Wei ; Cui, Zhiming

  • Author_Institution
    Soochow Univ., Suzhou
  • fYear
    2007
  • fDate
    21-23 Nov. 2007
  • Firstpage
    195
  • Lastpage
    198
  • Abstract
    Object matching is a crucial step to integration of Deep Web sources. Existing methods suppose that record extraction and attribute segmentation are of high accuracy. But because of limitation of extraction techniques, information gained through the above methods is often incomplete. If we match object base on noisy and incomplete information, we can not achieve satisfactory performance. This paper proposes a hybrid object matching method, which considers structured and unstructured features and multi-level errors in extraction. We compare performance of the unstructured, structured and hybrid object matching models in our prototype system, which indicates that hybrid method has the highest performance.
  • Keywords
    Internet; database management systems; feature extraction; pattern matching; Deep Web; hybrid object matching; information integration; multilevel errors; unstructured feature extraction; Chaos; Data mining; Databases; HTML; Information processing; Information technology; Internet; Prototypes; Uniform resource locators; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Convergence Information Technology, 2007. International Conference on
  • Conference_Location
    Gyeongju
  • Print_ISBN
    0-7695-3038-9
  • Type

    conf

  • DOI
    10.1109/ICCIT.2007.185
  • Filename
    4420259