DocumentCode
3260171
Title
An Improved Feature Representation Method for Maximum Entropy Model
Author
Yi, Guan ; Jian, Zhao
Author_Institution
Harbin Inst. of Technol.
fYear
2006
fDate
Dec. 2006
Firstpage
400
Lastpage
406
Abstract
In maximum entropy model (MEM), features are typically represented by either 0-1 binary-valued function or real-valued function. However, both representations only examine the impact of specific value of some attributes but not their types. Such negligence not only causes the decreasing of classification precision, but also slows the convergence speed of the generalized iterative scaling (GIS) algorithm, as more apparent to incomplete data. In this paper, an improved feature representation method is presented. The feature is composed of two parts: the first one is for specific value of an attribute; the second one is for the type of corresponding attribute. The experimental results on Mushroom dataset of UCI data repository showed that the average classifying precisions on incomplete dataset and complete dataset were improved by 1.5% and 3.0% respectively, and the average convergence speed was improved by 42.9% and 90.7% respectively
Keywords
knowledge representation; maximum entropy methods; pattern classification; Mushroom dataset; UCI data repository; corresponding attribute; feature representation; incomplete data; maximum entropy model; specific value attribute; Bayesian methods; Convergence; Data analysis; Data mining; Entropy; Geographic Information Systems; Internet; Iterative algorithms; Maximum likelihood estimation; Statistical analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location
Hong Kong
Print_ISBN
0-7695-2702-7
Type
conf
DOI
10.1109/ICDMW.2006.29
Filename
4063660
Link To Document