Title :
Classification techniques for recurrent DNA copy number data
Author :
Alqallaf, Abdullah K. ; Tewfik, Ahmed H.
Author_Institution :
Dept. of Electr. & Comput. Eng., Minnesota Univ., Minneapolis, MN
Abstract :
Genetic instabilities in the human genome are frequently exhibited in the form of DNA copy number variations. In this paper, we propose a framework to evaluate the predictive power of recurrent copy number variations at multiple genomic sites for detecting genetic diseases. Moreover, we compare the ability of two well known clustering algorithms, k-means and Fuzzy c-means, to correctly classify diseased samples with respect to the control samples. Finally, we apply our proposed techniques on 51 samples of 25 apparently healthy and 26 autistic children. Our results show that using CNVs at multiple sites will considerably increase classification performance when compared with the traditional classifiers that focus on a single CNV.
Keywords :
DNA; biology computing; fuzzy set theory; genetics; pattern classification; Fuzzy c-means clustering; genetic disease detection; human genome; k-means clustering; recurrent DNA copy number data; Autism; Bioinformatics; Clustering algorithms; DNA; Diseases; Fuzzy control; Genetics; Genomics; Humans; Pediatrics; Classification methods; DNA copy numberanalysis; Selective-smoothing technique; Statistical-based model;
Conference_Titel :
Communications, Control and Signal Processing, 2008. ISCCSP 2008. 3rd International Symposium on
Conference_Location :
St Julians
Print_ISBN :
978-1-4244-1687-5
Electronic_ISBN :
978-1-4244-1688-2
DOI :
10.1109/ISCCSP.2008.4537411