Title :
Adaptive name matching in information integration
Author :
Bilenko, Mikhail ; Mooney, Raymond ; Cohen, William ; Ravikumar, Pradeep ; Fienberg, Stephen
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
Abstract :
Identifying approximately duplicate database records that refer to the same entity is essential for information integration. The authors compare and describe methods for combining and learning textual similarity measures for name matching.
Keywords :
Internet; distributed databases; learning (artificial intelligence); string matching; text analysis; Internet; Web pages; adaptive name matching; duplicate database records; heterogeneous information sources; information integration; machine learning; string similarity measures; textual similarity measures; Character recognition; Costs; Couplings; Data mining; Databases; Object detection; Optical character recognition software; Optical recording; Uncertainty; Web pages;
Journal_Title :
Intelligent Systems, IEEE
DOI :
10.1109/MIS.2003.1234765