• DocumentCode
    2771281
  • Title

    A Study of Supervised Learning with Multivariate Analysis on Unbalanced Datasets

  • Author

    Ou, Yu-Yen ; Hung, Hao-Geng ; Oyang, Yen-Jen

  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    2201
  • Lastpage
    2205
  • Abstract
    How to handle unbalanced datasets and how to handle high-dimensional datasets are two of the most challenging issues faced by the latest machine learning research. This article reports a study aimed at providing effective solutions to these two challenges. For handling unbalanced datasets, we proposed that a different value of the cost parameter in Support Vector Machine (SVM) is employed for each class of samples. For handling high-dimensional datasets, we resorted to Independent Components Analysis (ICA), which is a multivariate analysis algorithm, along with the conventional univariate analysis. Fxperimental results confirmed that the proposed approaches all together significantly improved the prediction accuracy delivered by SVM.
  • Keywords
    data handling; independent component analysis; learning (artificial intelligence); support vector machines; high-dimensional datasets; independent components analysis; machine learning; multivariate analysis; supervised learning; support vector machine; unbalanced datasets; Algorithm design and analysis; Classification algorithms; Costs; Data analysis; Feature extraction; Independent component analysis; Machine learning; Supervised learning; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2006. IJCNN '06. International Joint Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-9490-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2006.247014
  • Filename
    1716384