• DocumentCode
    3027415
  • Title

    Evaluating the Ambiguity of class structures via Instance Neighbor Entropy with weighting

  • Author

    Jing-Doo Wang ; Yao-Chug Shi

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Asia Univ., Taichung, Taiwan
  • fYear
    2010
  • fDate
    4-6 Aug. 2010
  • Firstpage
    43
  • Lastpage
    48
  • Abstract
    In this paper Instance Neighbor Entropy INE with weighting was proposed to estimate the Class Structure Ambiguity (CSA) of class structures. The main idea of the INE{x)k for one instance x was to compute the weighted entropy of class probability distribution of the top k nearest neighbors of that x. The weighting associated with that entropy was determined according to the inverse of the distance between the x and the other instances. One instance was seemed as ambiguous one if most of its neighbors came from the other classes. Therefore, one class structure might be ambiguous if it contained a lot of ambiguous instances. To evaluate the effectiveness of the CSA via INE, the Pearson´s correlation coefficient p between the values of accuracy achieved by SVM classifiers and the values of CSA was computed and expected to be close - 1 (perfect negative correlation) as possible. For experiments, there were two types of datasets. One was according to some seed points for each class and, for each seed point, there were a fixed number instances generated randomly under normal distribution while with class ambiguity under control. The other was selected from the LIBSVM as read world datasets. Experimental results showed that the evaluation of the CSA via INE(x)k did reveal the degree of class ambiguous with datasets generated randomly because the values of the p almost as -1, and the INE(x)k with weighted entropy evaluated more precisely than that without weighted entropy when with both types of datasets.
  • Keywords
    combinatorial mathematics; entropy; normal distribution; pattern classification; support vector machines; SVM classifier; ambiguous instance; class structure ambiguity; correlation coefficient; instance neighbor entropy; k nearest neighbor; normal distribution; probability distribution; weighted entropy; ambiguity; class structure; classification; entropy;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Frontier Computing. Theory, Technologies and Applications, 2010 IET International Conference on
  • Conference_Location
    Taichung
  • Type

    conf

  • DOI
    10.1049/cp.2010.0535
  • Filename
    5632279