DocumentCode :
2624179
Title :
A Hybrid Object Matching Method for Deep Web Information Integration
Author :
Zhao, Pengpeng ; Lin, Chao ; Fang, Wei ; Cui, Zhiming
Author_Institution :
Soochow Univ., Suzhou
fYear :
2007
fDate :
21-23 Nov. 2007
Firstpage :
195
Lastpage :
198
Abstract :
Object matching is a crucial step to integration of Deep Web sources. Existing methods suppose that record extraction and attribute segmentation are of high accuracy. But because of limitation of extraction techniques, information gained through the above methods is often incomplete. If we match object base on noisy and incomplete information, we can not achieve satisfactory performance. This paper proposes a hybrid object matching method, which considers structured and unstructured features and multi-level errors in extraction. We compare performance of the unstructured, structured and hybrid object matching models in our prototype system, which indicates that hybrid method has the highest performance.
Keywords :
Internet; database management systems; feature extraction; pattern matching; Deep Web; hybrid object matching; information integration; multilevel errors; unstructured feature extraction; Chaos; Data mining; Databases; HTML; Information processing; Information technology; Internet; Prototypes; Uniform resource locators; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
Type :
conf
DOI :
10.1109/ICCIT.2007.185
Filename :
4420259
Link To Document :
بازگشت