• DocumentCode
    226700
  • Title

    Investigating distance metric learning in semi-supervised fuzzy c-means clustering

  • Author

    Lai, Daphne Teck Ching ; Garibaldi, Jonathan M. ; Reps, Jenna

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Nottingham, Nottingham, UK
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    1817
  • Lastpage
    1824
  • Abstract
    The idea behind distance metric learning (DML) is to accentuate the distance relations found in the training data, maintaining whether the data patterns are similar or dissimilar. In this paper, we investigate in using DML (GDML, LMNN, MCML and NCA) in semi-supervised Fuzzy c-means clustering and apply them on a real, biomedical dataset and on UCI datasets. We used a cross validation setting with varying amount of labelled data to test our methodology. Out of eight datasets, statistical significant improvement was found on five datasets using ssFCM with DML. This shows that DML can improve ssFCM clustering for some datasets. Further analysis using 2D PCA projection and sum of squared distances before and after DML transformation of the original data are carried out. Interestingly, DML was found to worsen ssFCM clustering in the NTBC dataset with hierarchical clusters.
  • Keywords
    fuzzy set theory; learning (artificial intelligence); pattern classification; pattern clustering; statistical analysis; 2D PCA projection; DML; GDML; LMNN; MCML; NCA; NTBC dataset; UCI datasets; cross-validation; dissimilar data patterns; distance metric learning; distance relations; hierarchical clusters; labelled data; real biomedical dataset; semisupervised fuzzy c-means clustering; similar data patterns; ssFCM clustering; statistical analysis; sum-of-squared distances; training data; Accuracy; Cardiography; Clustering algorithms; Measurement; Principal component analysis; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ-IEEE), 2014 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-2073-0
  • Type

    conf

  • DOI
    10.1109/FUZZ-IEEE.2014.6891673
  • Filename
    6891673