DocumentCode :
2334126
Title :
Classification with degree of membership: a fuzzy approach
Author :
Au, Wai-Ho ; Chan, Keith C C
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, China
fYear :
2001
fDate :
2001
Firstpage :
35
Lastpage :
42
Abstract :
Classification is an important topic in data mining research. It is concerned with the prediction of the values of some attribute in a database based on other attributes. To tackle this problem, most of the existing data mining algorithms adopt either a decision tree based approach or an approach that requires users to provide some user-specified thresholds to guide the search for interesting rules. The authors propose a new approach based on the use of an objective interestingness measure to distinguish interesting rules from uninteresting ones. Using linguistic terms to represent the revealed regularities and exceptions, this approach is especially useful when the discovered rules are presented to human experts for examination because of the affinity with the human knowledge representation. The use of a fuzzy technique allows the prediction of attribute values to be associated with degree of membership. Our approach is therefore able to deal with the cases where an object can belong to more than one class. Furthermore, our approach is more resilient to noise and missing data values because of the use of a fuzzy technique. To evaluate the performance of our approach, we tested it using several real-life databases. The experimental results show that it can be very effective at data mining tasks. When compared to popular data mining algorithms, the approach is better able to uncover useful rules hidden in databases
Keywords :
computational linguistics; data mining; fuzzy set theory; pattern classification; very large databases; attribute values; classification; data mining algorithms; data mining tasks; decision tree based approach; degree of membership; discovered rules; fuzzy approach; fuzzy technique; human experts; human knowledge representation; interesting rules; linguistic terms; objective interestingness measure; real-life databases; uninteresting rules; Data mining; Databases; Decision trees; Electronic mail; Gold; Humans; Knowledge representation; Marketing management; Production; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
Type :
conf
DOI :
10.1109/ICDM.2001.989498
Filename :
989498
Link To Document :
بازگشت