• DocumentCode
    527363
  • Title

    A new instance selection algorithm based on contribution for nearest neighbour classification

  • Author

    Cai, Yong-hua ; Wu, Bo ; He, Yu-Lin ; Zhang, Ye

  • Author_Institution
    Dept. of Math. & Comput. Sci., Hebei Normal Univ. for Nat., Chengde, China
  • Volume
    1
  • fYear
    2010
  • fDate
    11-14 July 2010
  • Firstpage
    155
  • Lastpage
    160
  • Abstract
    Nearest Neighbor Classifier is one of the most classical lazy learning schemes. The basic nearest neighbor classifiers suffer from the common problem that the instances used to train the classifier are all stored indiscriminately, and as a result, the required memory storage is huge and response time becomes slow with a large database. In this paper, a new Instances Selection algorithm based on Classification Contribution Function shortly named ISCC is presented. In this algorithm, a function is introduced to evaluate the classification ability of the instances. For each instance, the function considers its contribution to the neighbor instances not only with same class but also with different class. Then an instance with the highest value of Classification Contribution Function is added to the condensed subset and the instances which can be classified correctly are deleted in each iteration. This process is repeated until the subset is no longer getting larger. The time complexity of ISCC is O (in2). The experimental results on two artificial databases and some real databases demonstrate the effectiveness and the feasibility of the proposed algorithm. Compared to traditional methods, such as MCS, ICF and ENN, the condensed sets obtained by ISCC is superior in storage and classification accuracy.
  • Keywords
    computational complexity; learning (artificial intelligence); pattern classification; classification contribution function; instance selection algorithm; memory storage requirement; nearest neighbour classification; time complexity; Accuracy; Classification algorithms; Learning; Machine learning; Nearest neighbor searches; Noise; Training; ENN; ICF; ISC; Instance Selection; MCS; Nearest Neighbour Rule; Noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4244-6526-2
  • Type

    conf

  • DOI
    10.1109/ICMLC.2010.5581074
  • Filename
    5581074