DocumentCode
2135257
Title
A new algorithm for predicting protein coding regions based on the hybird threshold
Author
Yu-Tao Ma ; Jin Che ; Xiao-Guang Lu ; Jian-Fu Teng ; Li-Yi Zhang
Author_Institution
Sch. of Phys. Electr. Inf. Eng., Ningxia Univ., Yinchuan, China
fYear
2012
fDate
16-18 Oct. 2012
Firstpage
846
Lastpage
849
Abstract
In protein coding regions prediction works, the predicted non-coding percentile (PNCP) is usually used as the threshold to separate the DNA sequences into two kinds of regions, i.e. protein coding regions and non-coding regions. In this paper, a new protein coding regions prediction algorithm based on the hybrid threshold is presented. First, the normalized power spectral density (PSD) of a DNA sequence is calculated using the prediction algorithm based on narrow pass-band filters (NPBF), and the maximum PSD value of the DNA sequence is used as the normalization standard in the NPBF algorithm. Second, the coding regions´ PSD curve is set up as a trapezoid model characterized by its slope and height. Third, the method for calculating the hybrid threshold which taking the slope and the height both into consideration is presented. Finally, the algorithm is performed on DNA dataset HMR195. Using the approximate correlation (AC) as the evaluation measure of prediction accuracy, the prediction results of the proposed algorithm reach to 0.48 for the dataset, which is much better than the modified Gabor wavelet transform algorithm. The prediction results are also presented in the form of q9 proposed by Chun-Ting Zhang in 2002.
Keywords
DNA; Gabor filters; band-pass filters; biology computing; encoding; molecular biophysics; molecular configurations; proteins; wavelet transforms; DNA dataset HMR195; DNA sequences; approximate correlation; hybrid threshold; modified Gabor wavelet transform algorithm; narrow pass-band filters; noncoding percentile; normalization standard; normalized power spectral density; prediction accuracy; protein coding regions; trapezoid model; narrow pass-band filter; power spectral density curve; prediction algorithm; protein coding regions; threshold;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical Engineering and Informatics (BMEI), 2012 5th International Conference on
Conference_Location
Chongqing
Print_ISBN
978-1-4673-1183-0
Type
conf
DOI
10.1109/BMEI.2012.6513065
Filename
6513065
Link To Document