Title :
Imbalanced Sentiment Classification with Multi-strategy Ensemble Learning
Author :
Wang, Zhongqing ; Li, Shoushan ; Zhou, Guodong ; Li, Peifeng ; Zhu, Qiaoming
Author_Institution :
Natural Language Process. Lab., Soochow Univ., Suzhou, China
Abstract :
Recently, sentiment classification has become a hot research topic in natural language processing. But most existing studies assume that the samples in the negative and positive categories are balanced, which might not be true in real applications. In this paper, we investigate sentiment classification tasks where the class distribution of the sam-ples is imbalanced. To handle the imbalanced problem, we propose a multi-strategy ensemble learning approach to this problem. Our ensemble approach integrates sample-ensemble, feature-ensemble, and classifier-ensemble by ex-ploiting multiple classification algorithms. Evaluation across four domains shows that our ensemble approach outper-forms many other popular approaches that handling imbal-anced classification problems, such as re-sampling and cost-sensitive approaches, and is proven effective for imbalanced sentiment classification.
Keywords :
learning (artificial intelligence); natural language processing; pattern classification; text analysis; classifier-ensemble; cost-sensitive approach; feature-ensemble; imbalanced classification problem handling; imbalanced sentiment classification; multistrategy ensemble learning approach; natural language processing; Classification algorithms; Entropy; Learning systems; Semantics; Thumb; Training; Training data; ensemble learning; imbalanced classification; sentiment classification;
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
DOI :
10.1109/IALP.2011.28