DocumentCode :
3106729
Title :
The Influence of Class Imbalance on Cost-Sensitive Learning: An Empirical Study
Author :
Liu, Xu-Ying ; Zhou, Zhi-Hua
Author_Institution :
Nat. Lab. for Novel Software Technol., Nanjing Univ., Nanjing
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
970
Lastpage :
974
Abstract :
In real-world applications the number of examples in one class may overwhelm the other class, but the primary interest is usually on the minor class. Cost-sensitive learning has been deeded as a good solution to these class-imbalanced tasks, yet it is not clear how does the class-imbalance affect cost-sensitive classifiers. This paper presents an empirical study using 38 data sets, which discloses that class-imbalance often affects the performance of cost-sensitive classifiers: When the misclassification costs are not seriously unequal, cost-sensitive classifiers generally favor natural class distribution although it might be imbalanced; while when misclassification costs are seriously unequal, a balanced class distribution is more favorable.
Keywords :
learning (artificial intelligence); pattern classification; class imbalance task; cost-sensitive classifier; cost-sensitive learning; misclassification cost; natural class distribution; Application software; Biomedical monitoring; Cost function; Data mining; Decision trees; Design methodology; Intrusion detection; Laboratories; Learning systems; Medical diagnosis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.158
Filename :
4053137
Link To Document :
بازگشت