• DocumentCode
    932362
  • Title

    TreeDT: tree pattern mining for gene mapping

  • Author

    Sevon, P. ; Toivonen, H. ; Ollikainen, V.

  • Author_Institution
    Dept. of Comput. Sci., Helsinki Univ.
  • Volume
    3
  • Issue
    2
  • fYear
    2006
  • Firstpage
    174
  • Lastpage
    185
  • Abstract
    We describe TreeDT, a novel association-based gene mapping method. Given a set of disease-associated haplotypes and a set of control haplotypes, TreeDT predicts likely locations of a disease susceptibility gene. TreeDT extracts, essentially in the form of haplotype trees, information about historical recombinations in the population: A haplotype tree constructed at a given chromosomal location is an estimate of the genealogy of the haplotypes. TreeDT constructs these trees for all locations on the given haplotypes and performs a novel disequilibrium test on each tree: Is there a small set of subtrees with relatively high proportions of disease-associated chromosomes, suggesting shared genetic history for those and a likely disease gene location? We give a detailed description of TreeDT and the tree disequilibrium tests, we analyze the algorithm formally, and we evaluate its performance experimentally on both simulated and real data sets. Experimental results demonstrate that TreeDT has high accuracy on difficult mapping tasks and comparisons to other methods (EATDT, HPM, TDT) show that TreeDT is very competitive
  • Keywords
    cellular biophysics; data mining; diseases; genetics; medical diagnostic computing; molecular biophysics; trees (mathematics); TreeDT; association-based gene mapping; disease gene location; disease susceptibility gene; disease-associated chromosomes; disease-associated haplotypes; genealogy; haplotype tree; historical recombinations; shared genetic history; tree disequilibrium tests; tree pattern mining; Algorithm design and analysis; Biological cells; Chromosome mapping; Data mining; Diseases; Genetics; History; Performance analysis; Performance evaluation; Testing; Biology and genetics; nonnumerical algorithms and problems.; nonparametric statistics; Algorithms; Chromosome Mapping; Computational Biology; Computer Simulation; Diabetes Mellitus, Type 1; Genetic Predisposition to Disease; Haplotypes; Humans; Linkage Disequilibrium; Pedigree; Recombination, Genetic; Statistics, Nonparametric;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2006.28
  • Filename
    1631998