DocumentCode :
667343
Title :
Frequent weighted itemset mining from gene expression data
Author :
Baralis, Elena ; Cagliero, Luca ; Cerquitelli, Tania ; Chiusano, Silvia ; Garza, Paolo
Author_Institution :
Dipt. di Autom. e Inf., Politec. di Torino, Turin, Italy
fYear :
2013
fDate :
10-13 Nov. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Gene Expression Datasets (GEDs) usually consist of the expression values of thousands of genes within hundreds of samples. Frequent itemset and association rule mining algorithms have been applied to discover significant co-expressions among multiple genes from GEDs. To perform these data analyses, gene expression values are commonly discretized into a predefined number of bins. Such an expert-driven and not trivial preprocessing step could bias the quality of the mining result. This paper presents a novel approach to discovering gene correlations from GEDs which does not require data discretization. By representing per-sample gene expression values as item weights, frequent weighted itemsets can be extracted. The discovery of weighted itemsets instead of traditional (not weighted) ones prevents experts from discretizing GEDs before analyzing them and thus improves the effectiveness of the knowledge discovery process. Experiments performed on real GEDs demonstrate the effectiveness of the proposed approach.
Keywords :
bioinformatics; data analysis; data mining; GEDs; association rule mining algorithms; data analysis; frequent weighted itemset mining; gene correlation discovery; gene expression datasets; knowledge discovery process; per-sample gene expression values; weighted itemset discovery; Association rules; Bioinformatics; Context; Gene expression; Itemsets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/BIBE.2013.6701681
Filename :
6701681
Link To Document :
بازگشت