DocumentCode :
2691744
Title :
Time and Frequency Domain Methods for Gene and Exon Prediction in Eukaryotes
Author :
Akhtar, Maria ; Epps, Julien ; Ambikairajah, E.
Author_Institution :
New South Wales Univ., Sydney, NSW, Australia
Volume :
2
fYear :
2007
fDate :
15-20 April 2007
Abstract :
The detection of period-3 components in exons of eukaryotic gene sequences enables signal processing based time-domain and frequency-domain methods to predict these regions. In this paper, we improve the prediction accuracy of frequency-domain methods by proposing a new algorithm known as the paired and weighted spectral rotation (PWSR) measure, which exploits both period-3 behaviour and another useful statistical property of genomic sequences. By comparison with existing frequency-domain approaches, the proposed PWSR method reveals relative improvements of 15.2% and 10.7% respectively over spectral content and spectral rotation measures in terms of prediction accuracy of exonic nucleotides at a 10% false positive rate using the GENSCAN test set. Finally, we combine the proposed PWSR with an existing time-domain method to demonstrate further signal processing-based improvements in gene and exon prediction accuracy.
Keywords :
biology computing; frequency-domain analysis; genetics; molecular biophysics; statistical analysis; time-domain analysis; eukaryotic gene sequences; exon prediction; exonic nucleotides; frequency domain methods; gene prediction; genomic sequences; paired and weighted spectral rotation; period-3 behaviour; signal processing; time domain methods; Accuracy; Bioinformatics; Frequency domain analysis; Frequency measurement; Genomics; Nuclear measurements; Rotation measurement; Signal processing algorithms; Testing; Time domain analysis; Correlation; DNA; Discrete Fourier transforms; Signal processing; Time-frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366300
Filename :
4217473
Link To Document :
بازگشت