DocumentCode
3109672
Title
A Modified Chi2 Algorithm Based on the Significance of Attribute
Author
Zhang, Hao ; Miao, Duoqian ; Wang, Ruizhi
Author_Institution
Dept. of Comput. Sci. & Technol., Tongji Univ., Shanghai
fYear
2006
fDate
Dec. 2006
Firstpage
490
Lastpage
493
Abstract
Discretization is one of the important components of the data preprocessing. Discretization can turn numeric attributes into discrete ones. There are many different kinds of discretization methods. This paper describes the Chi2 algorithm which is a simple and general discretization algorithm. In this algorithm, the chi2 statistic value is used as an evaluative standard to discretize the numeric attributes. However, the Chi2 algorithm dose not consider the sequence of discretization for each attribute in the second phase. And the inconsistency rate cannot fully reflect the characteristic of dataset. These drawbacks will affect the result of discretization finally. In this paper, some concepts of the rough set are introduced to improve the Chi2 algorithm
Keywords
knowledge acquisition; pattern classification; rough set theory; statistical analysis; Chi2 algorithm; data preprocessing; discretization methods; rough set; Chaos; Computer science; Data preprocessing; Intelligent agent; Merging; Space technology; Statistics; Testing; Training data; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology Workshops, 2006. WI-IAT 2006 Workshops. 2006 IEEE/WIC/ACM International Conference on
Conference_Location
Hong Kong
Print_ISBN
0-7695-2749-3
Type
conf
DOI
10.1109/WI-IATW.2006.13
Filename
4053299
Link To Document