• DocumentCode
    3731026
  • Title

    Attribute Reduction of gene signal based on Improved OTSU discretization method

  • Author

    Yun Liu; Tao Hou; Ke Wang; Fu Liu

  • Author_Institution
    College of Communications Engineering, Jilin University, Changchun, China
  • fYear
    2015
  • Firstpage
    983
  • Lastpage
    987
  • Abstract
    k-mer frequency has been widely used as digital features of DNA fragments in microbial DNA recognition. However, to achieve ideal identification accuracy, it often needs to extract a nearly ten thousand-dimensional vector from DNA fragments as species labels. The high dimension of the feature vector will lead to excessive calculation loss. Rough set theory is a good method for attitude reduction but can only deal with discrete data, so a new OTSU discretization method is presented in this paper. Experiments on 30 microbial strains signals and 6 UCI datasets were carried out and the results show that using rough set theory can get less feature dimension and higher classification accuracy after discretization by this method. The number of features can be reduced by 69.53%, with 6.28% higher accuracy achieved and the operation time can be reduced by 78.38%.
  • Keywords
    "Microorganisms","DNA","Classification algorithms","Genomics","Bioinformatics","Testing","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Chinese Automation Congress (CAC), 2015
  • Type

    conf

  • DOI
    10.1109/CAC.2015.7382641
  • Filename
    7382641