• DocumentCode
    667241
  • Title

    A kernel SVM algorithm to detect mislabeled microarrays in human cancer samples

  • Author

    Martin-Merino, Manuel

  • Author_Institution
    Comput. Sci. Dept., Univ. Pontificia of Salamanca, Salamanca, Spain
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    DNA Microarrays have been successfully applied to the identification of different cancer types considering the gene expression profiles. However, previous studies have shown that labeling errors are not uncommon in microarray studies. In this case, the training set may contain mislabelled examples that may lead the classifier to poor performance. In this paper we propose a new filtering algorithm based on one-class SVM classification to detect mislabelled samples. To this aim, samples and labels are mapped together to feature space using the kernel of dissimilarities. Next, outliers are detected via one-class classification. Mislabeled samples and outliers in input space can be separated comparing the outliers obtained in input and feature spaces. The algorithm proposed has been tested using several complex cancer microarray datasets in which some samples are mislabelled according to the literature. The experimental results suggest that our algorithm is effective detecting labeling errors and compares favorably with a standard technique such as simple SVM.
  • Keywords
    DNA; cancer; feature extraction; genetics; information filtering; lab-on-a-chip; medical computing; pattern classification; support vector machines; DNA microarrays; cancer microarray datasets; cancer types identification; dissimilarities kernel; feature space; filtering algorithm; gene expression profiles; human cancer samples; kernel SVM algorithm; labeling errors; mislabeled microarrays detection; mislabelled samples detection; one-class SVM classification; outliers detection; Bioinformatics; Breast cancer; Colon; Kernel; Labeling; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/BIBE.2013.6701579
  • Filename
    6701579