DocumentCode :
3259859
Title :
A Comparison of Personal Name Matching: Techniques and Practical Issues
Author :
Christen, Peter
Author_Institution :
Dept. of Comput. Sci., Australian Nat. Univ., Canberra, ACT
fYear :
2006
fDate :
Dec. 2006
Firstpage :
290
Lastpage :
294
Abstract :
Finding and matching personal names is at the core of an increasing number of applications: from text and Web mining, search engines, to information extraction, deduplication and data linkage systems. Variations and errors in names make exact string matching problematic, and approximate matching techniques have to be applied. When compared to general text, however, personal names have different characteristics that need to be considered. In this paper, we discuss the characteristics of personal names and present potential sources of variations and errors. We then overview a comprehensive number of commonly used, as well as some recently developed name matching techniques. Experimental comparisons using four large name data sets indicate that there is no clear best matching technique
Keywords :
string matching; text analysis; Web mining; data linkage system; deduplication; information extraction; personal name matching; search engine; string matching; text mining; Computer errors; Computer science; Conferences; Data mining; Information retrieval; Model driven engineering; Radio frequency; Terrorism; Tin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
Type :
conf
DOI :
10.1109/ICDMW.2006.2
Filename :
4063641
Link To Document :
بازگشت