Title :
Metric-Based Data Mining Model for Genealogical Record Linkage
Author :
Ivie, Stephen ; Pixton, Burdette ; Giraud-Carrier, Christophe
Author_Institution :
Brigham Young Univ., Provo
Abstract :
Genealogical Record Linkage (GRL) is the process of determining whether two pedigrees refer to the same base individual. Unlike other record linkage problems, GRL datasets are extremely sparse and have several multi-valued attributes. In this paper, we describe a metric-based, data mining approach to GRL, and report on its successful application to a large post-blocking, standardized database.
Keywords :
biology computing; data mining; genetics; genealogical record linkage; metric-based data mining; multivalued attribute; Cancer; Computer science; Couplings; Data mining; Databases; Decision trees; Diseases; Genetics; Performance evaluation; Testing;
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location :
Las Vegas, IL
Print_ISBN :
1-4244-1500-4
Electronic_ISBN :
1-4244-1500-4
DOI :
10.1109/IRI.2007.4296676