Title :
Knowledge measurement based on Rough Set
Author :
Yan Xu ; Bin, Wang
Author_Institution :
Beijing Language & Culture Univ., Beijing, China
Abstract :
In this paper, a method is proposed to measure attribute´s importance based on rough set theory. According to rough set theory, knowledge about a universe of objects may be defined as classifications based on certain properties of the objects, i.e. rough set theory assume that knowledge is an ability to partition objects. We quantify the ability of classify objects, and call the amount of this ability as knowledge quantity. The more knowledge quantity the attributes have, the more important they are in the information system. In addition, automatic feature selection methods such as document frequency thresholding (DF), is commonly applied in text categorization, but DF method does not have an academic interpretation, it is usually considered an empirical approach to improve efficiency. We put forward an interpretation of DF based on knowledge quantity.
Keywords :
document handling; knowledge engineering; rough set theory; document frequency thresholding; knowledge measurement; knowledge quantity; rough set theory; text categorization; Frequency; Gain measurement; Information systems; Mutual information; Natural languages; Performance gain; Power measurement; Set theory; Statistics; Text categorization;
Conference_Titel :
Granular Computing, 2009, GRC '09. IEEE International Conference on
Conference_Location :
Nanchang
Print_ISBN :
978-1-4244-4830-2
DOI :
10.1109/GRC.2009.5255042