DocumentCode
2771281
Title
A Study of Supervised Learning with Multivariate Analysis on Unbalanced Datasets
Author
Ou, Yu-Yen ; Hung, Hao-Geng ; Oyang, Yen-Jen
fYear
0
fDate
0-0 0
Firstpage
2201
Lastpage
2205
Abstract
How to handle unbalanced datasets and how to handle high-dimensional datasets are two of the most challenging issues faced by the latest machine learning research. This article reports a study aimed at providing effective solutions to these two challenges. For handling unbalanced datasets, we proposed that a different value of the cost parameter in Support Vector Machine (SVM) is employed for each class of samples. For handling high-dimensional datasets, we resorted to Independent Components Analysis (ICA), which is a multivariate analysis algorithm, along with the conventional univariate analysis. Fxperimental results confirmed that the proposed approaches all together significantly improved the prediction accuracy delivered by SVM.
Keywords
data handling; independent component analysis; learning (artificial intelligence); support vector machines; high-dimensional datasets; independent components analysis; machine learning; multivariate analysis; supervised learning; support vector machine; unbalanced datasets; Algorithm design and analysis; Classification algorithms; Costs; Data analysis; Feature extraction; Independent component analysis; Machine learning; Supervised learning; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location
Vancouver, BC
Print_ISBN
0-7803-9490-9
Type
conf
DOI
10.1109/IJCNN.2006.247014
Filename
1716384
Link To Document