• DocumentCode
    259585
  • Title

    Adding Diversity to Rank Examples in Anytime Nearest Neighbor Classification

  • Author

    Lemes, Cristiano Inacio ; Silva, Diego Furtado ; Batista, Gustavo E. A. P. A.

  • Author_Institution
    Inst. de Cinencias Mat. e de Comput., Univ. de Sao Paulo, Sao Paulo, Brazil
  • fYear
    2014
  • fDate
    3-6 Dec. 2014
  • Firstpage
    129
  • Lastpage
    134
  • Abstract
    In the last decade we have witnessed a huge increase of interest in data stream learning algorithms. A stream is an ordered sequence of data records. It is characterized by properties such as the potentially infinite and rapid flow of instances. However, a property that is common to various application domains and is frequently disregarded is the very high fluctuating data rates. In domains with fluctuating data rates, the events do not occur with a fixed frequency. This imposes an additional challenge for the classifiers since the next event can occur at any time after the previous one. Anytime classification provides a very convenient approach for fluctuating data rates. In summary, an anytime classifier can be interrupted at any time before its completion and still be able to provide an intermediate solution. The popular k-nearest neighbor (k-NN) classifier can be easily made anytime by introducing a ranking of the training examples. A classification is achieved by scanning the training examples according to this ranking. In this paper, we show how the current state-of-the-art k-NN anytime classifier can be made more accurate by introducing diversity in the training set ranking. Our results show that, with this simple modification, the performance of the anytime version of the k-NN algorithm is consistently improved for a large number of datasets.
  • Keywords
    learning (artificial intelligence); pattern classification; anytime nearest neighbor classification; data stream learning algorithms; k-NN anytime classifier; k-nearest neighbor anytime classifier; ordered data record sequence; training set ranking; very high fluctuating data rates; Accuracy; Approximation algorithms; Artificial neural networks; Image recognition; Indexes; Sorting; Training; Anytime Algorithm; Classification; Data Stream; Nearest Neighbor;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2014 13th International Conference on
  • Conference_Location
    Detroit, MI
  • Type

    conf

  • DOI
    10.1109/ICMLA.2014.26
  • Filename
    7033103