Title :
Multi-label Classification: Dealing with Imbalance by Combining Labels
Author :
Ming Fang ; Yuqi Xiao ; Chongjun Wang ; Junyuan Xie
Author_Institution :
Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
Abstract :
Data imbalance is a common problem both in single-label classification (SLC) and multi-label classification (MLC). There is no doubt that the predicting result suffers from this problem. Although, a broad range of studies associate with imbalance problem, most of them focus on SLC and for MLC is relatively less. Actually, this problem arising in MLCis more frequent and complex than in SLC. In this paper, we proceed from dealing with imbalance problem for MLC and propose a new approach called DEML. DEML transforms the whole label set of multi-label dataset into some subsets and each subset is treated as a multi-class dataset with balanced class distribution, which not only addressing imbalance problem but also preserving dataset integrity and consistency. Extensive experiments show that DEML possesses highly competitive performance both in computation and effectiveness.
Keywords :
pattern classification; DEML approach; MLC; SLC; class distribution; data imbalance; dataset consistency; dataset integrity; multiclass dataset; multilabel classification; single-label classification; Benchmark testing; Complexity theory; Conferences; Data mining; Entropy; Training; Transforms; combining labels; imbalance; multi-label classification;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2014 IEEE 26th International Conference on
Conference_Location :
Limassol
DOI :
10.1109/ICTAI.2014.42