• DocumentCode
    2515148
  • Title

    A Multi-task Feature Selection Filter for Microarray Classification

  • Author

    Lan, Liang ; Vucetic, Slobodan

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Temple Univ., Philadelphia, PA, USA
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    160
  • Lastpage
    165
  • Abstract
    A major challenge in microarray classification and biomarker discovery is dealing with small-sample high-dimensional data where the number of genes used as features is typically orders of magnitude larger than the number of labeled microarrays. One way to address this challenge is by leveraging information from the publicly accessible repositories of microarray data. Following this idea, a multi-task feature selection filter is proposed that borrows strength from the auxiliary microarray classification data sets. The filter uses Kruskal-Wallis test on auxiliary data sets and ranks genes based on their aggregated p-values. Expressions of the top-ranked genes are used as features to build a classifier on the target data set. The proposed approach was evaluated on 9 microarray data sets related to 9 different types of cancers. Comparison of the classification accuracies reveals that the multi-task feature selection is superior to single-task feature selection. Furthermore, the results strongly suggest that multi-task algorithms could improve microarray classification by exploiting auxiliary data during feature selection and learning.
  • Keywords
    bioinformatics; cancer; genetics; medical information systems; molecular biophysics; Kruskal-Wallis test; aggregated p-values; auxiliary microarray classification data sets; bioinformatics tool; cancers; microarray classification; multitask algorithms; multitask feature selection filter; top-ranked genes; Brain; Cancer; Classification algorithms; Extraterrestrial measurements; Filters; Logistics; Neoplasms; Testing; Training data; Vectors; feature selection; microarray classification; multi-task learning; transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-0-7695-3885-3
  • Type

    conf

  • DOI
    10.1109/BIBM.2009.79
  • Filename
    5341826