DocumentCode :
2035378
Title :
Hash based biclustering for class discovery from gene expression data: A pattern similarity approach
Author :
Mishra, Debahuti ; Shaw, Kailash ; Mishra, Sashikala ; Rath, Amiya Kumar ; Acharya, Milu
Author_Institution :
Inst. of Tech. Educ. & Res., Siksha O Anusandhan Univ., Bhubaneswar, India
Volume :
2
fYear :
2011
fDate :
8-10 April 2011
Firstpage :
137
Lastpage :
141
Abstract :
Cellular processes come forth on subsets of genes to be co-expressed and correlated under certain experimental conditions, but behaves almost independently under other conditions. So, discovering such local expression patterns may uncover many genetic mechanisms which lead to class discovery. In this paper, we have proposed an efficient hash based biclustering approach, which identifies coherent patterns known as scaling and shifting patterns from high dimensional gene expression datasets. Our proposed algorithm consists of two steps: first, we pre-process the data set by reducing the attributes without much loss of information using Principal Component Analysis (PCA) and second, an enhanced pCluster algorithm using hashing technique to make the searching faster is used to discover the scaling and shifting patterns, which leads to patter based biclusters and those biclusters will contribute for class discovery. Finally, we have compared our method with some existing pattern based models and it has been found that our algorithm is very versatile and promising.
Keywords :
biology computing; cellular biophysics; file organisation; genetics; pattern clustering; principal component analysis; PCA; cellular process; class discovery; data set preprocess; genetic mechanism; hash based biclustering; hashing technique; high dimensional gene expression dataset; pCluster algorithm; pattern based model; pattern similarity; principal component analysis; scaling pattern; shifting pattern; Algorithm design and analysis; Biological system modeling; Clustering algorithms; Data models; Gene expression; Indexes; Principal component analysis; Biclusteing; Class Discovery; Gene Expression profiling; Hashing; Principal Componenet Analysis; ScalingPattern; Shifting Pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4244-8678-6
Electronic_ISBN :
978-1-4244-8679-3
Type :
conf
DOI :
10.1109/ICECTECH.2011.5941671
Filename :
5941671
Link To Document :
بازگشت