Title :
A Study of Supervised Learning with Multivariate Analysis on Unbalanced Datasets
Author :
Ou, Yu-Yen ; Hung, Hao-Geng ; Oyang, Yen-Jen
Abstract :
How to handle unbalanced datasets and how to handle high-dimensional datasets are two of the most challenging issues faced by the latest machine learning research. This article reports a study aimed at providing effective solutions to these two challenges. For handling unbalanced datasets, we proposed that a different value of the cost parameter in Support Vector Machine (SVM) is employed for each class of samples. For handling high-dimensional datasets, we resorted to Independent Components Analysis (ICA), which is a multivariate analysis algorithm, along with the conventional univariate analysis. Fxperimental results confirmed that the proposed approaches all together significantly improved the prediction accuracy delivered by SVM.
Keywords :
data handling; independent component analysis; learning (artificial intelligence); support vector machines; high-dimensional datasets; independent components analysis; machine learning; multivariate analysis; supervised learning; support vector machine; unbalanced datasets; Algorithm design and analysis; Classification algorithms; Costs; Data analysis; Feature extraction; Independent component analysis; Machine learning; Supervised learning; Support vector machine classification; Support vector machines;
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9490-9
DOI :
10.1109/IJCNN.2006.247014