• DocumentCode
    667343
  • Title

    Frequent weighted itemset mining from gene expression data

  • Author

    Baralis, Elena ; Cagliero, Luca ; Cerquitelli, Tania ; Chiusano, Silvia ; Garza, Paolo

  • Author_Institution
    Dipt. di Autom. e Inf., Politec. di Torino, Turin, Italy
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Gene Expression Datasets (GEDs) usually consist of the expression values of thousands of genes within hundreds of samples. Frequent itemset and association rule mining algorithms have been applied to discover significant co-expressions among multiple genes from GEDs. To perform these data analyses, gene expression values are commonly discretized into a predefined number of bins. Such an expert-driven and not trivial preprocessing step could bias the quality of the mining result. This paper presents a novel approach to discovering gene correlations from GEDs which does not require data discretization. By representing per-sample gene expression values as item weights, frequent weighted itemsets can be extracted. The discovery of weighted itemsets instead of traditional (not weighted) ones prevents experts from discretizing GEDs before analyzing them and thus improves the effectiveness of the knowledge discovery process. Experiments performed on real GEDs demonstrate the effectiveness of the proposed approach.
  • Keywords
    bioinformatics; data analysis; data mining; GEDs; association rule mining algorithms; data analysis; frequent weighted itemset mining; gene correlation discovery; gene expression datasets; knowledge discovery process; per-sample gene expression values; weighted itemset discovery; Association rules; Bioinformatics; Context; Gene expression; Itemsets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/BIBE.2013.6701681
  • Filename
    6701681