Title :
Combinatorial method of splice sites prediction
Author :
Churbanov, Alexander ; Ali, Hesham
Author_Institution :
Dept. of Comput. Sci., Nebraska Univ., Omaha, NE, USA
Abstract :
Predicting and proper ranking of splice sites (SS) is a challenging problem in bioinformatics and machine learning communities. Proposed method of donor and acceptor SSs prediction is based on counting oligonucleotide frequencies for splice and splice-like signals. Based on bayesian principle SS sensors were built. We demonstrate advantage of our proposed sensor design compared with existing sensors and tools. In particular, our donor sensor outperforms Maximum Entropy Sensor for several representative test sets of genes when compared on Receiver Operating Characteristic (ROC) curve. We represent combinatorial interaction of SSs and related factors with Logarithm Ofo Dds (LOD) weight matrices. Based on factor interactions we were able to substantially improve splice signals prediction quality and rank SSs better than SpliceView, GeneSplicer. NNSplice and Genio tools. Proposed method is used in our new splicing simulator SpliceScan.
Keywords :
DNA; belief networks; biology computing; genetics; learning (artificial intelligence); maximum entropy methods; molecular biophysics; sensitivity analysis; acceptor splice site; bayesian principle; bioinformatics; combinatorial method; donor splice site; gene; genesplicer; machine learning; maximum entropy sensor; oligonucleotide frequency counting; receiver operating characteristic curve; splice sites prediction; splicing simulator splicescan; Bayesian methods; Bioinformatics; Biosensors; Computer science; Entropy; Frequency; Sensitivity; Sensor phenomena and characterization; Sensor systems; Splicing;
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN :
0-7695-2442-7
DOI :
10.1109/CSBW.2005.40