DocumentCode :
3106635
Title :
A novel interpolation based missing value estimation method to predict missing values in microarray gene expression data
Author :
Bose, Sayan ; Das, Carl ; Dutta, Suparna ; Chattopadhyay, Subrata
Author_Institution :
Dept. of CSE & Eng., NSEC, Kolkata, India
fYear :
2012
fDate :
28-29 Dec. 2012
Firstpage :
318
Lastpage :
321
Abstract :
Microarray experiments can generate data sets with multiple missing expression values, normally due to various experimental problems. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. Thereore, effective missing value estimation methods are essential to minimize the effect of incomplete data sets on analysis, and to increase the range of data sets to which these algorithms can be applied. In this regard, a new interpolation based imputation method is proposed to predict missing values in microarray gene expression data. The proposed method selects a subset of similar genes and a subset of similar samples with respect to each missing position and then applies interpolation in a novel manner to predict that missing value. The performance of the proposed method is studied based on the normalized root mean square error with existing estimation techniques including K-nearest neighbor (KNN), Sequential K-nearest neighbor (SKNN) and Iterative K-nearest neighbor (IKNN). The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on different microarray data sets.
Keywords :
biology computing; genetic algorithms; genetics; genomics; interpolation; iterative methods; mean square error methods; sequential estimation; gene expression analysis algorithms; interpolation based missing value estimation method; iterative K-nearest neighbor; microarray gene expression data; multiple missing expression values; normalized root mean square error; sequential K-nearest neighbor; Bioinformatics; DNA; Estimation; Gene expression; Interpolation; Prediction algorithms; Vectors; interpolation; microarray; missing value estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Devices and Intelligent Systems (CODIS), 2012 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4673-4699-3
Type :
conf
DOI :
10.1109/CODIS.2012.6422202
Filename :
6422202
Link To Document :
بازگشت