• DocumentCode
    1668058
  • Title

    Adaptive thresholding for multi-label SVM classification with application to protein subcellular localization prediction

  • Author

    Shibiao Wan ; Man-Wai Mak ; Sun-Yuan Kung

  • Author_Institution
    Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Hung Hom, China
  • fYear
    2013
  • Firstpage
    3547
  • Lastpage
    3551
  • Abstract
    Multi-label classification has received increasing attention in computational proteomics, especially in protein subcellular localization. Many existing multi-label protein predictors suffer from over-prediction because they use a fixed decision threshold to determine the number of labels to which a query protein should be assigned. To address this problem, this paper proposes an adaptive thresholding scheme for multi-label support vector machine (SVM) classifiers. Specifically, each one-vs-rest SVM has an adaptive threshold that is a fraction of the maximum score of the one-vs-rest SVMs in the classifier. Therefore, the number of class labels of the query protein depends on the confidence of the SVMs in the classification. This scheme is integrated into our recently proposed subcellular localization predictor that uses the frequency of occurrences of gene-ontology terms as feature vectors and one-vs-rest SVMs as classifiers. Experimental results on two recent datasets suggest that the scheme can effectively avoid both over-prediction and under-prediction, resulting in performance significantly better than other gene-ontology based subcellular localization predictors.
  • Keywords
    biology computing; genetics; ontologies (artificial intelligence); proteins; support vector machines; adaptive thresholding; computational proteomics; feature vector; gene-ontology; multilabel SVM classification; multilabel protein predictor; one-vs-rest SVM; protein subcellular localization prediction; query protein; support vector machine; Accuracy; Databases; Ontologies; Proteins; Support vector machines; Training; Vectors; Adaptive thresholding; Gene Ontology; Multi-label SVM; Multi-label classification; Protein subcellular localization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638318
  • Filename
    6638318