DocumentCode :
674927
Title :
A Modified Local Least Squares-Based Missing Value Estimation Method in Microarray Gene Expression Data
Author :
Bose, Sayan ; Das, Carl ; Gangopadhyay, Tamaghna ; Chattopadhyay, Subrata
Author_Institution :
Dept. of Comput. Sc. & Eng, Netaji Subhash Eng. Collgee, Kolkata, India
fYear :
2013
fDate :
15-17 Dec. 2013
Firstpage :
18
Lastpage :
23
Abstract :
Micro array gene expression data often contains missing values normally due to various experimental reasons. However, most of the gene expression data analysis algorithms, such as clustering, classification and network design, require a complete matrix of gene array during analysis. It is therefore very important to accurately impute the missing values before applying the data analysis algorithms. In this paper, a modified Local Least Square imputation based algorithm known as NSLLSimpute has been introduced which overcomes the drawbacks of previously developed LLSimpute and SLLSimpute algorithms. The performance of NSLLSimpute algorithm is compared with the most commonly used imputation methods like K-nearest neighbor imputation (KNNimpute), Sequential K-nearest neighbor imputation (SKNNimpute), Iterative K-nearest neighbor imputation (IKNNimpute), Singular Value Decomposition (SVDimpute), Local Least Squares imputation (LLSimpute) and Sequential Local Least Squares imputation (SLLSimpute) in terms estimation accuracy using Root Mean Square error when applied on four publicly available micro array data sets over different rates of randomly introduced missing entries.
Keywords :
biology computing; data analysis; least squares approximations; singular value decomposition; IKNNimpute algorithm; KNNimpute algorithm; NSLLSimpute algorithm; SKNNimpute algorithm; SLLSimpute algorithms; SVDimpute algorithm; iterative k-nearest neighbor imputation; k-nearest neighbor imputation; micro array gene expression data analysis; modified local least square imputation based algorithm; modified local least squares-based missing value estimation method; root mean square error; sequential k-nearest neighbor imputation; sequential local least squares imputation; singular value decomposition; terms estimation accuracy; Accuracy; Algorithm design and analysis; Estimation; Euclidean distance; Gene expression; Least squares approximations; Vectors; Least squares; Microarray Gene expression data; Missing value estimation; RMS error;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing, Networking and Security (ADCONS), 2013 2nd International Conference on
Conference_Location :
Mangalore
Type :
conf
DOI :
10.1109/ADCONS.2013.11
Filename :
6714131
Link To Document :
بازگشت