DocumentCode
932362
Title
TreeDT: tree pattern mining for gene mapping
Author
Sevon, P. ; Toivonen, H. ; Ollikainen, V.
Author_Institution
Dept. of Comput. Sci., Helsinki Univ.
Volume
3
Issue
2
fYear
2006
Firstpage
174
Lastpage
185
Abstract
We describe TreeDT, a novel association-based gene mapping method. Given a set of disease-associated haplotypes and a set of control haplotypes, TreeDT predicts likely locations of a disease susceptibility gene. TreeDT extracts, essentially in the form of haplotype trees, information about historical recombinations in the population: A haplotype tree constructed at a given chromosomal location is an estimate of the genealogy of the haplotypes. TreeDT constructs these trees for all locations on the given haplotypes and performs a novel disequilibrium test on each tree: Is there a small set of subtrees with relatively high proportions of disease-associated chromosomes, suggesting shared genetic history for those and a likely disease gene location? We give a detailed description of TreeDT and the tree disequilibrium tests, we analyze the algorithm formally, and we evaluate its performance experimentally on both simulated and real data sets. Experimental results demonstrate that TreeDT has high accuracy on difficult mapping tasks and comparisons to other methods (EATDT, HPM, TDT) show that TreeDT is very competitive
Keywords
cellular biophysics; data mining; diseases; genetics; medical diagnostic computing; molecular biophysics; trees (mathematics); TreeDT; association-based gene mapping; disease gene location; disease susceptibility gene; disease-associated chromosomes; disease-associated haplotypes; genealogy; haplotype tree; historical recombinations; shared genetic history; tree disequilibrium tests; tree pattern mining; Algorithm design and analysis; Biological cells; Chromosome mapping; Data mining; Diseases; Genetics; History; Performance analysis; Performance evaluation; Testing; Biology and genetics; nonnumerical algorithms and problems.; nonparametric statistics; Algorithms; Chromosome Mapping; Computational Biology; Computer Simulation; Diabetes Mellitus, Type 1; Genetic Predisposition to Disease; Haplotypes; Humans; Linkage Disequilibrium; Pedigree; Recombination, Genetic; Statistics, Nonparametric;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2006.28
Filename
1631998
Link To Document