DocumentCode
3314839
Title
Improved Decision Tree Method for Imbalanced Data Sets in Digital Forensics
Author
Liu Qin
Author_Institution
Dept. of Inf. Sci. & Technol., East China Univ. of Political Sci. & Law, Shanghai, China
fYear
2012
fDate
17-19 Aug. 2012
Firstpage
251
Lastpage
254
Abstract
Improved decision tree ID3 algorithm for suiting digital forensics is presented in the study. Forensics data are imbalanced, inconstant, noisy and dispersive. Based on these characteristic, we improve ID3 algorithm by adopting correction factor and two times information gain, which can avoid the large data bias of ID3 algorithm. The experimental results show that the improved algorithm has good simplicity and low error rate compared with ID3. It can be seen that the improved method used in the digital forensics process is entirely feasible.
Keywords
computer forensics; decision trees; ID3 algorithm; decision tree method; digital forensics; error rate; imbalanced data sets; Accuracy; Algorithm design and analysis; Classification algorithms; Computers; Decision trees; Digital forensics; Information entropy; Decision Tree; Digital forensics; ID3; Imbalanced data;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational and Information Sciences (ICCIS), 2012 Fourth International Conference on
Conference_Location
Chongqing
Print_ISBN
978-1-4673-2406-9
Type
conf
DOI
10.1109/ICCIS.2012.171
Filename
6300460
Link To Document