DocumentCode
2624179
Title
A Hybrid Object Matching Method for Deep Web Information Integration
Author
Zhao, Pengpeng ; Lin, Chao ; Fang, Wei ; Cui, Zhiming
Author_Institution
Soochow Univ., Suzhou
fYear
2007
fDate
21-23 Nov. 2007
Firstpage
195
Lastpage
198
Abstract
Object matching is a crucial step to integration of Deep Web sources. Existing methods suppose that record extraction and attribute segmentation are of high accuracy. But because of limitation of extraction techniques, information gained through the above methods is often incomplete. If we match object base on noisy and incomplete information, we can not achieve satisfactory performance. This paper proposes a hybrid object matching method, which considers structured and unstructured features and multi-level errors in extraction. We compare performance of the unstructured, structured and hybrid object matching models in our prototype system, which indicates that hybrid method has the highest performance.
Keywords
Internet; database management systems; feature extraction; pattern matching; Deep Web; hybrid object matching; information integration; multilevel errors; unstructured feature extraction; Chaos; Data mining; Databases; HTML; Information processing; Information technology; Internet; Prototypes; Uniform resource locators; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Convergence Information Technology, 2007. International Conference on
Conference_Location
Gyeongju
Print_ISBN
0-7695-3038-9
Type
conf
DOI
10.1109/ICCIT.2007.185
Filename
4420259
Link To Document