Title :
A fuzzy-based instance selection approach for data mining
Author :
Wright, Peggy ; Hodges, Julia
Author_Institution :
Eng. R&D Center, US Army Corps of Eng., Vicksburg, MS, USA
Abstract :
Data mining is an area that is enjoying increasing growth. One of the most time-consuming tasks in data mining is data preparation or pre-processing. Since data pre-processing takes more time and effort than the rest of the data mining process, the need for improved data pre-processing methods is well recognized. Dealing with missing values can further complicate data pre-processing. Several methods have been used to resolve the problem of instance selection when there are missing data values. Most of these methods: 1) discard records with missing values; 2) use all records and ignore missing values; or 3) use all records and infer missing values. These methods do not consider the utility of individual attributes. Here, we introduce a fuzzy-based information metric that considers the usefulness of the individual attributes by incorporating domain knowledge into a multicriteria decision-making instance selection technique
Keywords :
data mining; data preparation; fuzzy set theory; knowledge based systems; data mining; data pre-processing; data preparation; fuzzy set theory; knowledge based system; multicriteria decision-making; Computer science; Data mining; Data preprocessing; Open wireless architecture; Research and development; Uncertainty;
Conference_Titel :
Fuzzy Systems, 2000. FUZZ IEEE 2000. The Ninth IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
0-7803-5877-5
DOI :
10.1109/FUZZY.2000.838690