DocumentCode :
2553140
Title :
On the gene team mining problem
Author :
Chen, Hao-Sen ; Lee, Guanling ; Peng, Sheng-Lung
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Dong Hwa Univ., Hualien, Taiwan
fYear :
2012
fDate :
29-31 May 2012
Firstpage :
1124
Lastpage :
1127
Abstract :
Let Σ be a set of n genes. A chromosome G can be represented as a permutation of Σ. A subset D of Σ is a δ-set of G if two consecutive genes in G∩D has distance at most δ. For a set G of m chromosomes, a set D is a δ-team of G if D is a δ-set of every chromosome of G. Given a gene set Σ, a chromosome set G, and an integer δ, the gene team finding problem is to find all possible δ-teams of G. Given a gene set Σ, a chromosome set G of m chromosomes, an integer k ≤ m, and an integer δ, the gene team mining problem is to find all possible δ-teams for any possible chromosome set G´ such that G´ ⊆ G and |G´| ≥ k. In this paper, we study the gene team mining problem. It is known that the Apriori technique is used wildly in data mining. However, the gene team mining problem has no Apriori property that all nonempty subsets of a δ-team (δ-set) must also be a δ-team (δ-set). Thus, many techniques used in data mining cannot be applied for this gene team mining problem. In this paper, we propose a concept of pseudo-support. By using this concept, an Apriori-like algorithm can be obtained to solve the gene team mining problem.
Keywords :
bioinformatics; cellular biophysics; data mining; genetics; set theory; δ-set; δ-team; Apriori algorithm; Apriori property; Apriori technique; chromosome representation; chromosome set; data mining; gene team finding problem; gene team mining problem; genes set; permutation representation; pseudosupport concept; Bioinformatics; Biological cells; Clustering algorithms; Computational biology; Data mining; Databases; Genomics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
Type :
conf
DOI :
10.1109/FSKD.2012.6234337
Filename :
6234337
Link To Document :
بازگشت