• DocumentCode
    19244
  • Title

    An Adaptive Window Length Strategy for Eukaryotic CDS Prediction

  • Author

    Shakya, Devendra Kumar ; Saxena, R. ; Sharma, Sanjeev Narayan

  • Author_Institution
    Dept. of Biomed. Eng., Samrat Ashok Technol. Inst., Vidisha, India
  • Volume
    10
  • Issue
    5
  • fYear
    2013
  • fDate
    Sept.-Oct. 2013
  • Firstpage
    1241
  • Lastpage
    1252
  • Abstract
    Signal processing-based algorithms for identification of coding sequences (CDS) in eukaryotes are non-data driven and exploit the presence of three-base periodicity in these regions for their detection. Three-base periodicity is commonly detected using short time Fourier transform (STFT) that uses a window function of fixed length. As the length of the protein coding and noncoding regions varies widely, the identification accuracy of STFT-based algorithms is poor. In this paper, a novel signal processing-based algorithm is developed by enabling the window length adaptation in STFT of DNA sequences for improving the identification of three-base periodicity. The length of the window function has been made adaptive in coding regions to maximize the magnitude of period-3 measure, whereas in the noncoding regions, the window length is tailored to minimize this measure. Simulation results on bench mark data sets demonstrate the advantage of this algorithm when compared with other non-data-driven methods for CDS prediction.
  • Keywords
    DNA; bioinformatics; molecular biophysics; molecular configurations; proteins; signal processing; DNA sequences; STFT; adaptive window length strategy; coding sequences; eukaryotic CDS prediction; identification accuracy; protein coding regions; protein noncoding regions; short time Fourier transform; signal processing-based algorithms; three-base periodicity; Bioinformatics; DNA; Encoding; Genomics; Prediction algorithms; Signal processing algorithms; Bioinformatics; deoxyribonucleic acid (DNA); short time Fourier transform (STFT); signal processing; window function;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.76
  • Filename
    6552192