DocumentCode :
3260171
Title :
An Improved Feature Representation Method for Maximum Entropy Model
Author :
Yi, Guan ; Jian, Zhao
Author_Institution :
Harbin Inst. of Technol.
fYear :
2006
fDate :
Dec. 2006
Firstpage :
400
Lastpage :
406
Abstract :
In maximum entropy model (MEM), features are typically represented by either 0-1 binary-valued function or real-valued function. However, both representations only examine the impact of specific value of some attributes but not their types. Such negligence not only causes the decreasing of classification precision, but also slows the convergence speed of the generalized iterative scaling (GIS) algorithm, as more apparent to incomplete data. In this paper, an improved feature representation method is presented. The feature is composed of two parts: the first one is for specific value of an attribute; the second one is for the type of corresponding attribute. The experimental results on Mushroom dataset of UCI data repository showed that the average classifying precisions on incomplete dataset and complete dataset were improved by 1.5% and 3.0% respectively, and the average convergence speed was improved by 42.9% and 90.7% respectively
Keywords :
knowledge representation; maximum entropy methods; pattern classification; Mushroom dataset; UCI data repository; corresponding attribute; feature representation; incomplete data; maximum entropy model; specific value attribute; Bayesian methods; Convergence; Data analysis; Data mining; Entropy; Geographic Information Systems; Internet; Iterative algorithms; Maximum likelihood estimation; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
Type :
conf
DOI :
10.1109/ICDMW.2006.29
Filename :
4063660
Link To Document :
بازگشت