• DocumentCode
    3714432
  • Title

    Identification of DNA-binding proteins by auto-cross covariance transformation

  • Author

    Qiwen Dong; Shanyi Wang; Kai Wang; Xuan Liu;Bin Liu

  • Author_Institution
    School of Computer Science, Fudan University, Shanghai 200433, China
  • fYear
    2015
  • Firstpage
    470
  • Lastpage
    475
  • Abstract
    DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences are unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding protein from the protein sequence information. In this study, a novel method is presented which combines the support vector machine and the auto-cross covariance transformation. The protein sequence represented in the form of amino acids or the physical-chemical properties of amino acids are converted into a series of fixed-length vectors by Kmer composition and the auto-cross covariance transformation. The sequence order effect can be effectively capture by this scheme. These vectors are then inputted to support vector machine to discriminate the DNA-binding proteins from the non DNA-binding ones. The proposed method achieves the overall accuracy of 75.23% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. The independent test shows that the proposed method outperforms most of the existing methods. These results demonstrate that the proposed method provides the state-of-the-art performance for the prediction of DNA-binding proteins.
  • Keywords
    "Proteins","DNA"
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/BIBM.2015.7359730
  • Filename
    7359730