• DocumentCode
    3314839
  • Title

    Improved Decision Tree Method for Imbalanced Data Sets in Digital Forensics

  • Author

    Liu Qin

  • Author_Institution
    Dept. of Inf. Sci. & Technol., East China Univ. of Political Sci. & Law, Shanghai, China
  • fYear
    2012
  • fDate
    17-19 Aug. 2012
  • Firstpage
    251
  • Lastpage
    254
  • Abstract
    Improved decision tree ID3 algorithm for suiting digital forensics is presented in the study. Forensics data are imbalanced, inconstant, noisy and dispersive. Based on these characteristic, we improve ID3 algorithm by adopting correction factor and two times information gain, which can avoid the large data bias of ID3 algorithm. The experimental results show that the improved algorithm has good simplicity and low error rate compared with ID3. It can be seen that the improved method used in the digital forensics process is entirely feasible.
  • Keywords
    computer forensics; decision trees; ID3 algorithm; decision tree method; digital forensics; error rate; imbalanced data sets; Accuracy; Algorithm design and analysis; Classification algorithms; Computers; Decision trees; Digital forensics; Information entropy; Decision Tree; Digital forensics; ID3; Imbalanced data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational and Information Sciences (ICCIS), 2012 Fourth International Conference on
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-1-4673-2406-9
  • Type

    conf

  • DOI
    10.1109/ICCIS.2012.171
  • Filename
    6300460