Title :
Learning from genome sequences utilizing computational intelligence
Author :
Yang, Jack Y. ; Yang, Mary Qu ; Ersoy, Okan
Author_Institution :
Harvard Med. Sch., Harvard Univ., Boston, MA
Abstract :
Advances in genome sequencing technology have led to an exploration in the amount of sequence data available, learning from proteins coded for by genomes is a difficult task. Bioinformatics is thus a burgeoning field that holds great promise for deepening our understanding of biochemical pathways, for understanding the genetic differences between species and how they arose, and for understanding the genetic basis of various disease processes. We developed a method for classification and knowledge discovery in membrane and intrinsic unstructured/disordered proteins (IUP). We analyzed the amino acid compositions and biophysical properties of proteins. Our joint transmembrane and IUP predictor utilized biophysical characterizations, feature generation, feature selection and computational intelligence as well as ensemble methods to improve the accuracies and performances
Keywords :
biology computing; biomembranes; data mining; feature extraction; genetics; learning (artificial intelligence); molecular biophysics; proteins; amino acid composition; bioinformatics; biophysical characterization; computational intelligence; data mining; feature generation; feature selection; genetic difference; genome sequence; intrinsic disordered protein; intrinsic unstructured protein; knowledge discovery; learning; membrane protein; transmembrane; Amino acids; Bioinformatics; Biomembranes; Character generation; Computational intelligence; Diseases; Genetics; Genomics; Protein engineering; Sequences; Bioinformatics; classification and knowledge discovery; data mining and computational intelligence; intrinsic unstructured proteins; membrane proteins;
Conference_Titel :
Computational Intelligence Methods and Applications, 2005 ICSC Congress on
Conference_Location :
Istanbul
Print_ISBN :
1-4244-0020-1
DOI :
10.1109/CIMA.2005.1662343