• DocumentCode
    3770372
  • Title

    Training set reduction using Geometric Median

  • Author

    Chatchai Kasemtaweechok;Worasait Suwannik

  • Author_Institution
    Department of Computer Science, Kasetsart University, Bangkok, Thailand
  • fYear
    2015
  • Firstpage
    153
  • Lastpage
    156
  • Abstract
    Learning large-scale dataset takes excessive processing time. Hence, smaller size of training set is beneficial to reduce the learning load. In this paper, a set of Geometric Medians are used as representative instances of the whole training set. Our proposed method can reduce the size of training sets to 0.015% - 10.81% of the original training set while the performance difference (F-Measure) is not over 6% from baseline models. In addition, this method has provided 1.91x to 7.01x speedup in learning time over the baseline models.
  • Keywords
    "Training","Simulated annealing","Skin","Diabetes","Iris","Data models","Information and communication technology"
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies (ISCIT), 2015 15th International Symposium on
  • Type

    conf

  • DOI
    10.1109/ISCIT.2015.7458330
  • Filename
    7458330