DocumentCode :
3261923
Title :
A rough set based minority class oriented learning algorithm for highly unbalanced data sets
Author :
Ye, Dongyi ; Chen, Zhaojiong
Author_Institution :
Coll. of Math. & Comput., Fuzhou Univ., Fuzhou
fYear :
2008
fDate :
26-28 Aug. 2008
Firstpage :
736
Lastpage :
739
Abstract :
Highly unbalanced data sets occur frequently in many practical applications and quite often the class of interest in such data sets is just a minority class. Like most standard machine learning methods, traditional rough sets based rule learning algorithms do not usually work well on highly unbalanced data sets. In this paper, we present a minority class rule learning algorithm for a highly unbalanced inconsistent data set where the class of interest is the minority one. The proposed algorithm pivots on discovery of the main features that discriminate the minority class from majority classes by finding the so called dominant minority subset. An illustrative example and a real application to customer churning prediction in Telecom are given to show the effectiveness of the proposed algorithm.
Keywords :
data mining; learning (artificial intelligence); rough set theory; Telecom; customer churning prediction; dominant minority subset; highly unbalanced data sets; machine learning methods; minority class oriented learning algorithm; rough set; rule learning algorithms; Application software; Data mining; Educational institutions; Learning systems; Machine learning; Machine learning algorithms; Mathematics; Rough sets; Set theory; Telecommunications;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
Type :
conf
DOI :
10.1109/GRC.2008.4664705
Filename :
4664705
Link To Document :
بازگشت