• DocumentCode
    1415314
  • Title

    Extending Attribute Information for Small Data Set Classification

  • Author

    Li, Der-Chiang ; Liu, Chiao-Wen

  • Author_Institution
    Dept. of Ind. & Inf. Manage., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    24
  • Issue
    3
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    452
  • Lastpage
    464
  • Abstract
    Data quantity is the main issue in the small data set problem, because usually insufficient data will not lead to a robust classification performance. How to extract more effective information from a small data set is thus of considerable interest. This paper proposes a new attribute construction approach which converts the original data attributes into a higher dimensional feature space to extract more attribute information by a similarity-based algorithm using the classification-oriented fuzzy membership function. Seven data sets with different attribute sizes are employed to examine the performance of the proposed method. The results show that the proposed method has a superior classification performance when compared to principal component analysis (PCA), kernel principal component analysis (KPCA), and kernel independent component analysis (KICA) with a Gaussian kernel in the support vector machine (SVM) classifier.
  • Keywords
    Gaussian processes; data handling; fuzzy set theory; information retrieval; pattern classification; principal component analysis; support vector machines; Gaussian kernel; attribute construction approach; attribute information; attribute information extraction; classification-oriented fuzzy membership function; data attributes; data quantity; feature space; kernel independent component analysis; kernel principal component analysis; principal component analysis; similarity-based algorithm; small data set classification; support vector machine classifier; Accuracy; Artificial neural networks; Classification algorithms; Feature extraction; Kernel; Principal component analysis; Support vector machines; Classification; feature construction; small data set; support vector machine.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.254
  • Filename
    5677515