DocumentCode :
1798174
Title :
Detection of non-structural outliers for microarray experiments
Author :
ZiHua Yang ; Zheng Rong Yang
Author_Institution :
Sch. of Biosci., Univ. of Exeter, Exeter, UK
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
1600
Lastpage :
1605
Abstract :
Outliers are unavoidable in many experiments due to various complex reasons ranging from equipment resolution to data contamination. The presence of outliers in microarray gene expression data can affect the quality of gene selection and ranking. This effect is severe when a microarray gene expression data is composed of too few samples. We classify outliers occurred in microarray gene expression data as structural and non-structural outliers. Structural outliers are gene dependent or sample dependent (or both) whereas non-structural outliers are gene and sample-independent. They are uninformative to gene expression differentiation but can cause misclassification of a differentially expressed gene as a non-differentially expressed one. While there are algorithms for detecting structural outliers, a different strategy is required for detecting non-structural outliers. We show the impact of non-structural outliers on gene selection/ranking and false discovery rate control. We also show the unsuitableness of existing outlier detection algorithms for detecting non-structural outliers. We propose a new algorithm for detecting non-structural outliers. It models the consecutive differences of ordered gene expressions as exponentially distributed. We use simulated and real data to demonstrate the efficacy of the proposed algorithm in correcting for non-structural outliers and improving gene selection/ranking and false discovery rate control.
Keywords :
biology computing; data handling; genetics; data contamination; equipment resolution; false discovery rate control; gene expression differentiation; gene ranking; gene selection; microarray experiments; microarray gene expression data; nonstructural outlier detection; structural outliers; Accuracy; Algorithm design and analysis; Gene expression; Prediction algorithms; Prostate cancer; Sensitivity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889818
Filename :
6889818
Link To Document :
بازگشت