• DocumentCode
    2007782
  • Title

    Inferring Sparse Kernel Combinations and Relevance Vectors: An Application to Subcellular Localization of Proteins

  • Author

    Damoulas, Theodoros ; Ying, Yiming ; Girolami, M.A. ; Campbell, Colin

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Glasgow, Glasgow, UK
  • fYear
    2008
  • fDate
    11-13 Dec. 2008
  • Firstpage
    577
  • Lastpage
    582
  • Abstract
    In this paper, we introduce two new formulations for multi-class multi-kernel relevance vector machines (m-RVMs) that explicitly lead to sparse solutions, both in samples and in number of kernels. This enables their application to large-scale multi-feature multinomial classification problems where there is an abundance of training samples, classes and feature spaces. The proposed methods are based on an expectation-maximization (EM) framework employing a multinomial probit likelihood and explicit pruning of non-relevant training samples. We demonstrate the methods on a low-dimensional artificial dataset. We then demonstrate the accuracy and sparsity of the method when applied to the challenging bioinformatics task of predicting protein subcellular localization.
  • Keywords
    bioinformatics; cellular biophysics; expectation-maximisation algorithm; pattern classification; proteins; bioinformatics; expectation-maximization framework; large-scale multifeature multinomial classification problem; multiclass multikernel relevance vector machines; multinomial probit likelihood; proteins subcellular localization; relevance vectors; sparse kernel combinations; Bayesian methods; Bioinformatics; Extraterrestrial measurements; Kernel; Large-scale systems; Machine learning; Mathematics; Optimization methods; Protein engineering; Support vector machines; Kernel combination; Protein subcellular localization; Relevance vector machine; Sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-0-7695-3495-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2008.124
  • Filename
    4725032