Author_Institution :
Dept. of Comput., Univ. of Bradford, Bradford, UK
Abstract :
Triple negative breast cancers (TNBC) are clinically heterogeneous, an aggressive subtype with poor diagnosis and strong resistance to therapy. There is a need to identify novel robust biomarkers with high specificity for early detection and therapeutic intervention. Microarray gene expression-based studies have offered significant advances in molecular classification and identification of diagnostic/prognostic signatures, however sample scarcity and cohort heterogeneity remains area of concern. In this study, we performed integrated analysis on independent microarray big data studies and identified a robust 880-gene signature for TNBC diagnosis. We further identified 16-gene (OGN, ESR1, GPC3, LHFP, AGR3, LPAR1, LRRC17, TCEAL1, CIRBP, NTN4, TUBA1C, TMSB10, RPL27, RPS3A, RPS18, and NOSTRIN) that are associated to TNBC tissues. The 880-gene signature achieved excellent classification accuracy ratio on each independent expression data sets with overall average of 99.06%, is an indication of its diagnostic power. Gene ontology enrichment analysis of 880-gene signature shows that cell-cycle pathways/processes are important clinical targets for triple negative breast cancer. Further verification of 880-gene signature could provide additive knowledge for better understanding and future direction of triple negative breast cancer research.
Keywords :
Big Data; biology computing; cancer; genetics; medical diagnostic computing; molecular biophysics; ontologies (artificial intelligence); pattern classification; AGR3; CIRBP; ESR1; GPC3; LHFP; LPAR1; LRRC17; NOSTRIN; NTN4; OGN; RPL27; RPS18; RPS3A; TCEAL1; TMSB10; TNBC diagnosis; TNBC tissues; TUBA1C; cell-cycle pathways/process; classification accuracy ratio; diagnostic/prognostic signature; early detection; expression data set; gene ontology enrichment analysis; independent microarray big data study; microarray big data integrated analysis; microarray gene expression-based study; molecular classification; robust 880-gene signature; robust biomarker; robust diagnostic signature; therapeutic intervention; triple negative breast cancer; Accuracy; Big data; Breast cancer; Educational institutions; Gene expression; Robustness; Bioinformatics; Gene ontology enrichment analysis; Gene signature; Microarray; Triple negative breast cancer;