• DocumentCode
    2448274
  • Title

    Fast instance selection hybrid algorithm adapted to large data sets

  • Author

    Ros, Frédéric ; Harba, Rachid

  • Author_Institution
    Inst. Prisme, Orleans Univ., Orleans, France
  • fYear
    2011
  • fDate
    14-16 Oct. 2011
  • Firstpage
    117
  • Lastpage
    122
  • Abstract
    This paper investigates a new hybrid algorithm for instance selection adapted to large databases. The key idea is to apply condensation algorithms to only small sets and useful patterns to reduce computation cost. The initial population is divided into “meta strata” resulting from the union of strata randomly generated. Interesting patterns are resulting from a reference “meta stratum” and are partitioned in clusters. For each “meta stratum” and each cluster, influencing patterns are selected on the basis of a 1-nn procedure. The sets of instances determined from all “meta strata” provide the final set. Experiments performed with various data sets are revealing the effectiveness and adequacy of the proposed approach.
  • Keywords
    data mining; database management systems; pattern clustering; 1-nn procedure; computation cost reduction; condensation algorithm; instance selection hybrid algorithm; large database; meta strata; meta stratum; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Databases; Prototypes; Training; clustering algorithm; instance selection; k-nearest neighbors; supervised classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Pattern Recognition (SoCPaR), 2011 International Conference of
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-4577-1195-4
  • Type

    conf

  • DOI
    10.1109/SoCPaR.2011.6089125
  • Filename
    6089125