DocumentCode
3770086
Title
Analysis of imputation algorithms for microarray gene expression data
Author
H L Shashirekha;Agaz Hussain Wani
Author_Institution
Department of Computer Science, Mangalore University, Mangalagangothri-574199 Mangalore, India
fYear
2015
Firstpage
589
Lastpage
593
Abstract
Microarray technology makes it possible to measure expression level of thousands of genes simultaneously in an efficient and inexpensive manner. However, due to various complexities in processing microarrays, expression information of various genes may be missing due to unreliable measurements. The occurrence of missing values in gene expression data can adversely affect downstream analyses such as clustering, dimensionality reduction etc. Different algorithms have been developed to estimate the missing values in different data sets and none of these algorithm works well with all the data sets. In this work, we explore the possible application of Mutual Nearest Neighbor (MNN) algorithm to impute the missing values, which shows comparable results with other well know imputation algorithms. We also have explored five different methods for missing value imputation namely Row Average Imputation, Mean Imputation, Median Imputation, k-Nearest Neighbor Imputation and combination of kNN based feature selection (kNNFS) and kNN-based imputation. The experiments are carried out on very high dimensional gene expression data such as Notterman Carcinoma and Notterman Adenocarcinoma data and the results are illustrated.
Keywords
"Gene expression","Multi-layer neural network","Algorithm design and analysis","Lungs","Clustering algorithms","Cancer"
Publisher
ieee
Conference_Titel
Applied and Theoretical Computing and Communication Technology (iCATccT), 2015 International Conference on
Type
conf
DOI
10.1109/ICATCCT.2015.7456953
Filename
7456953
Link To Document