DocumentCode :
2215490
Title :
An evolutionary-based approach for feature generation: Eukaryotic promoter recognition
Author :
Kamath, Uday ; De Jong, Kenneth A. ; Shehu, Amarda
Author_Institution :
Dept. Comput. Sci., George Mason Univ., Fairfax, VA, USA
fYear :
2011
fDate :
5-8 June 2011
Firstpage :
277
Lastpage :
284
Abstract :
Prediction of promoter regions continues to be a challenging subproblem in mapping out eukaryotic DNA. While this task is key to understanding the regulation of differential transcription, the gene-specific architecture of promoter sequences does not readily lend itself to general strategies. To date, the best approaches are based on Support Vector Machines (SVMs) that employ standard "spectrum" features and achieve promoter region classification accuracies from a low of 84% to a high of 94% depending on the particular species involved. In this paper, we propose a general and powerful methodology that uses Genetic Programming (GP) techniques to generate more complex and more gene-specific features to be used with a standard SVM for promoter region identification. We evaluate our methodology on three data sets from different species and observe consistent classification accuracies in the 94 95% range. In addition, because the GP-generated features are gene-specific, they can be used by biologists to advance their understanding of the architecture of eukaryotic promoter regions.
Keywords :
DNA; biology computing; genetic algorithms; genetics; pattern classification; support vector machines; SVM; eukaryotic DNA; eukaryotic promoter recognition; evolutionary-based approach; feature generation; genetic programming techniques; promoter region classification; promoter region identification; promoter region prediction; support vector machines; Accuracy; Artificial neural networks; Bioinformatics; DNA; Genomics; Support vector machines; Training; Evolutionary Algorithms; Promoter Prediction; Support Vector Machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location :
New Orleans, LA
ISSN :
Pending
Print_ISBN :
978-1-4244-7834-7
Type :
conf
DOI :
10.1109/CEC.2011.5949629
Filename :
5949629
Link To Document :
بازگشت