• DocumentCode
    567533
  • Title

    Choice of dimensionality reduction methods for feature and classifier fusion with nearest neighbor classifiers

  • Author

    Deegalla, Sampath ; Boström, Henrik ; Walgama, Keerthi

  • Author_Institution
    Dept. of Comput. & Syst. Sci., Stockholm Univ., Kista, Sweden
  • fYear
    2012
  • fDate
    9-12 July 2012
  • Firstpage
    875
  • Lastpage
    881
  • Abstract
    Often high dimensional data cause problems for currently used learning algorithms in terms of efficiency and effectiveness. One solution for this problem is to apply dimensionality reduction by which the original feature set could be reduced to a small number of features while gaining improved accuracy and/or efficiency of the learning algorithm. We have investigated multiple dimensionality reduction methods for nearest neighbor classification in high dimensions. In previous studies, we have demonstrated that fusion of different outputs of dimensionality reduction methods, either by combining classifiers built on reduced features, or by combining reduced features and then applying the classifier, may yield higher accuracies than when using individual reduction methods. However, none of the previous studies have investigated what dimensionality reduction methods to choose for fusion, when outputs of multiple dimensionality reduction methods are available. Therefore, we have empirically investigated different combinations of the output of four dimensionality reduction methods on 18 medicinal chemistry datasets. The empirical investigation demonstrates that fusion of nearest neighbor classifiers obtained from multiple reduction methods in all cases outperforms the use of individual dimensionality reduction methods, while fusion of different feature subsets is quite sensitive to the choice of dimensionality reduction methods.
  • Keywords
    learning systems; sensor fusion; signal classification; classifier fusion; dimensionality reduction methods; feature fusion; high dimensional data; individual reduction methods; learning algorithms; nearest neighbor classification; nearest neighbor classifiers; Accuracy; Covariance matrix; Educational institutions; Matrix decomposition; Principal component analysis; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Fusion (FUSION), 2012 15th International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4673-0417-7
  • Electronic_ISBN
    978-0-9824438-4-2
  • Type

    conf

  • Filename
    6289894