Title :
Evaluation of Breast Cancer Susceptibility Using Improved Genetic Algorithms to Generate Genotype SNP Barcodes
Author :
Cheng-Hong Yang ; Yu-Da Lin ; Li-Yeh Chuang ; Hsueh-Wei Chang
Author_Institution :
Dept. of Electron. Eng., Nat. Kaohsiung Univ. of Appl. Sci., Kaohsiung, Taiwan
Abstract :
Genetic association is a challenging task for the identification and characterization of genes that increase the susceptibility to common complex multifactorial diseases. To fully execute genetic studies of complex diseases, modern geneticists face the challenge of detecting interactions between loci. A genetic algorithm (GA) is developed to detect the association of genotype frequencies of cancer cases and noncancer cases based on statistical analysis. An improved genetic algorithm (IGA) is proposed to improve the reliability of the GA method for high-dimensional SNP-SNP interactions. The strategy offers the top five results to the random population process, in which they guide the GA toward a significant search course. The IGA increases the likelihood of quickly detecting the maximum ratio difference between cancer cases and noncancer cases. The study systematically evaluates the joint effect of 23 SNP combinations of six steroid hormone metabolisms, and signaling-related genes involved in breast carcinogenesis pathways were systematically evaluated, with IGA successfully detecting significant ratio differences between breast cancer cases and noncancer cases. The possible breast cancer risks were subsequently analyzed by odds-ratio (OR) and risk-ratio analysis. The estimated OR of the best SNP barcode is significantly higher than 1 (between 1.15 and 7.01) for specific combinations of two to 13 SNPs. Analysis results support that the IGA provides higher ratio difference values than the GA between breast cancer cases and noncancer cases over 3-SNP to 13-SNP interactions. A more specific SNP-SNP interaction profile for the risk of breast cancer is also provided.
Keywords :
bioinformatics; cancer; genetic algorithms; genetics; genomics; random processes; statistical analysis; GA method; IGA; SNP combination; breast cancer risks; breast cancer susceptibility; breast carcinogenesis pathways; common complex multifactorial diseases; gene characterization; gene identification; genetic association; genotype SNP barcode; genotype frequency; high-dimensional SNP-SNP interaction; improved genetic algorithm; loci interaction detection; maximum ratio difference; noncancer case; odds-ratio; random population process; risk-ratio analysis; search course; signaling-related genes; significant ratio differences; single nucleotide polymorphism; statistical analysis; steroid hormone metabolism; Breast cancer; Cancer; Classification; Genetic algorithms; Genetics; SNP-SNP interactions; Single nucleotide polymorphism; breast cancer; genetic algorithm; Algorithms; Automatic Data Processing; Breast Neoplasms; Computational Biology; Databases, Genetic; Female; Genetic Predisposition to Disease; Genotype; Humans; Models, Genetic; Models, Statistical; Polymorphism, Single Nucleotide; Risk; Signal Transduction; Tumor Markers, Biological;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2013.27