Title :
A rough set based minority class oriented learning algorithm for highly unbalanced data sets
Author :
Ye, Dongyi ; Chen, Zhaojiong
Author_Institution :
Coll. of Math. & Comput., Fuzhou Univ., Fuzhou
Abstract :
Highly unbalanced data sets occur frequently in many practical applications and quite often the class of interest in such data sets is just a minority class. Like most standard machine learning methods, traditional rough sets based rule learning algorithms do not usually work well on highly unbalanced data sets. In this paper, we present a minority class rule learning algorithm for a highly unbalanced inconsistent data set where the class of interest is the minority one. The proposed algorithm pivots on discovery of the main features that discriminate the minority class from majority classes by finding the so called dominant minority subset. An illustrative example and a real application to customer churning prediction in Telecom are given to show the effectiveness of the proposed algorithm.
Keywords :
data mining; learning (artificial intelligence); rough set theory; Telecom; customer churning prediction; dominant minority subset; highly unbalanced data sets; machine learning methods; minority class oriented learning algorithm; rough set; rule learning algorithms; Application software; Data mining; Educational institutions; Learning systems; Machine learning; Machine learning algorithms; Mathematics; Rough sets; Set theory; Telecommunications;
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
DOI :
10.1109/GRC.2008.4664705