Title :
Data Mining Based Fuzzy Classification Algorithm for Imbalanced Data
Author :
Xu, Le ; Chow, Mo-Yuen ; Taylor, Leroy S.
Author_Institution :
North Carolina State Univ., Raleigh
Abstract :
The elegant fuzzy classification algorithm proposed by Ishibuchi et al. (I-algorithm) has achieved satisfactory performance on many well-known test data sets that have usually been carefully preprocessed. However, the algorithm does not provide satisfactory performance for the problems with imbalanced data that are often encountered in real-world applications. This paper presents an extension of the I-algorithm to E-algorithm to alleviate the effect of data imbalance. Both the I-algorithm and the E-algorithm are applied to Duke Energy outage data for power distribution systems fault cause identification. Their performance on this real-world imbalanced data set is presented, compared, and analyzed to demonstrate the improvement of the extended algorithm.
Keywords :
data mining; fuzzy set theory; pattern classification; power distribution faults; power engineering computing; Duke Energy outage data; E-algorithm; I-algorithm; data imbalance; data mining; fuzzy classification algorithm; fuzzy sets; power distribution systems fault cause identification; Application software; Classification algorithms; Data mining; Fault diagnosis; Fuzzy reasoning; Fuzzy sets; Fuzzy systems; Performance analysis; Power distribution; Testing;
Conference_Titel :
Fuzzy Systems, 2006 IEEE International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9488-7
DOI :
10.1109/FUZZY.2006.1681806