Title :
Exploring Alternative Splicing Features Using Support Vector Machines
Author :
Xia, Jing ; Caragea, Doina ; Brown, Susan
Author_Institution :
Kansas State Univ., Manhattan, KS
Abstract :
Alternative splicing is a mechanism for generating different gene transcripts (called isoforms) from the same genomic sequence. Finding alternative splicing events experimentally is both expensive and time consuming. Computational methods, in general, and machine learning algorithms,in particular, can be used to complement experimental methods in the process of identifying alternative splicing events. In this paper, we explore the predictive power of a rich set of features that have been experimentally shown to affect alternative splicing. We use these features to build support vector machine (SVM) classifiers for distinguishing between alternatively spliced exons and constitutive exons.Our results show that simple linear SVM classifiers built from a rich set of features give results comparable to those of more sophisticated SVM classifiers that use more basic sequence features. Furthermore, we use feature selection methods to identify computationally the most informative features for the prediction problem considered.
Keywords :
feature extraction; genetics; learning (artificial intelligence); medical computing; splicing; support vector machines; SVM classifiers; alternative splicing features; feature selection methods; gene transcripts; genomic sequence; isoforms; machine learning algorithms; prediction problem; support vector machines; Bioinformatics; DNA; Genomics; Machine learning algorithms; Proteins; Sequences; Splicing; Support vector machine classification; Support vector machines; USA Councils; alternative splicing; feature construction; support vector machine;
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
DOI :
10.1109/BIBM.2008.12