Title :
A Hybrid Object Matching Method for Deep Web Information Integration
Author :
Zhao, Pengpeng ; Lin, Chao ; Fang, Wei ; Cui, Zhiming
Author_Institution :
Soochow Univ., Suzhou
Abstract :
Object matching is a crucial step to integration of Deep Web sources. Existing methods suppose that record extraction and attribute segmentation are of high accuracy. But because of limitation of extraction techniques, information gained through the above methods is often incomplete. If we match object base on noisy and incomplete information, we can not achieve satisfactory performance. This paper proposes a hybrid object matching method, which considers structured and unstructured features and multi-level errors in extraction. We compare performance of the unstructured, structured and hybrid object matching models in our prototype system, which indicates that hybrid method has the highest performance.
Keywords :
Internet; database management systems; feature extraction; pattern matching; Deep Web; hybrid object matching; information integration; multilevel errors; unstructured feature extraction; Chaos; Data mining; Databases; HTML; Information processing; Information technology; Internet; Prototypes; Uniform resource locators; Web pages;
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
DOI :
10.1109/ICCIT.2007.185