Title :
Sampling bias in microarray data analysis: A demonstration in the field of reproductive biology
Author :
Manafi, Shirin ; Uyar, A. ; Bener, Ayse
Author_Institution :
Data Sci. Lab. Dept. of Mech. & Ind. Eng., Ryerson Univ., Toronto, ON, Canada
Abstract :
The actual benefit from high-throughput microarray experiments strongly relies on elimination of all possible sources of biases during both the experimental procedure and data analysis process. Within the context of reproductive biology, microarray based transcriptomic analysis of oocyte and surrounding cumulus/granulosa cells poses significant challenges due to limited amount of samples and/or potential contaminations from adjacent cells. In this study, we investigated the effect of sampling bias on consistency of the microarray differential expression analysis in the field of reproduction. Experiments were conducted on five datasets obtained from publicly available microarray repositories. For each dataset, probe level expression values were extracted and background adjustment, inter-array quantile normalization and probe set summarization were performed according to the Robust Multi-Chip Average algorithm. Genes with a false discovery rate-corrected p value of <;0.05 and [Fold Change] > 2 were considered as differentially expressed. Results demonstrate that both number of replicates and including different subsets of available samples in the analysis alter the number of differentially expressed genes. We suggest that assessment of inter-sample variance prior to differential expression analysis is an important step in microarray experiments and proper handling of that variance may require alternative normalization and/or statistical test methods.
Keywords :
cellular biophysics; data analysis; genetics; lab-on-a-chip; statistical testing; contaminations; cumulus cells; genes; granulosa cells; interarray quantile normalization; intersample variance; microarray based transcriptomic analysis; microarray data analysis; microarray differential expression analysis; oocyte; probe level expression; reproductive biology; robust multichip average algorithm; sampling bias; statistical test methods; Abstracts; Cells (biology); Context; Data analysis; Decision support systems; Probes; Experimental design; Microarray Data Analysis; Normalization; Sampling Bias;
Conference_Titel :
Health Informatics and Bioinformatics (HIBIT), 2013 8th International Symposium on
Conference_Location :
Ankara
Print_ISBN :
978-1-4799-0700-7
DOI :
10.1109/HIBIT.2013.6661684