DocumentCode :
139698
Title :
Cloud-scale genomic signals processing classification analysis for gene expression microarray data
Author :
Harvey, Benjamin ; Soo-Yeon Ji
Author_Institution :
Bowie State Univ., Bowie, MD, USA
fYear :
2014
fDate :
26-30 Aug. 2014
Firstpage :
1843
Lastpage :
1846
Abstract :
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
Keywords :
DNA; RNA; cloud computing; data analysis; genetics; genomics; inference mechanisms; medical signal processing; signal classification; signal denoising; wavelet transforms; Cloud computing; DNA sequence data analysis; GCM; biological inference; biological processes; cloud-based distributed processing environment; cloud-scale genomic signals processing classification analysis; functional genomics; gene expression microarray data; gene pattern analysis; global cancer map; mRNA sequence data analysis; wavelet based denoising; wavelet classification; wavelet preprocessing; wavelet thresholding; Bioinformatics; Gene expression; Genomics; Signal processing; Tumors; Wavelet analysis; Wavelet transforms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE
Conference_Location :
Chicago, IL
ISSN :
1557-170X
Type :
conf
DOI :
10.1109/EMBC.2014.6943968
Filename :
6943968
Link To Document :
بازگشت