Title :
An Entropy-based Statistical Workflow Provides Noise-Minimizing Biological Annotation for
Author :
Aging, Muscular ; Koutsandreas, Theodoros ; Valavanis, Ioannis ; Pilalis, Eleftherios ; Chatziioannou, Aristotelis
Author_Institution :
Inst. of Biol., Medicinal Chem. & Biotechnol., Nat. Hellenic Res. Found. (NHRF), Athens, Greece
Abstract :
This study aims to expand the efficiency of the interpretation concerning the aging process, by exploring a broad gene set, derived from the analysis of an integrative transcriptomic microarray dataset. The dataset comprises human skeletal muscle samples, obtained from healthy males and females, that were used to derive a gene signature of a high informative content, with respect to its functional association with the aging phenotype. Towards this end, a multilayered computational workflow integrating advanced statistical methodologies for the derivation of reliable confidence measures, distribution-based entropy calculations to examine the informational content of the dataset, enrichment analysis, graph-theoretic methods and intuitive visualization was applied. Specifically, statistical testing revealed differentially expressed genes, while an uncertainty calculation algorithm, exploiting Gene Ontology (GO) terms annotations, extended the list of significant genes from 254 to 2791, namely p-value threshold was increased from 0.0005 to 0.103, while keeping simultaneously noise measurements legitimately low. This rich gene set associated functionally the macroscopic phenotype of muscular aging with highly informative, stably correlated with each other, molecular annotations in the GO database. Finally, a set of 57 reliable genes was identified that comprise a gender-independent aging signature, after incorporating crucial information about genes pivotal regulatory role as inferred by the GO tree. The biological interpretation was highly assisted by the illustration of the functional mappings between genes, cellular location and biological processes through circle packing graphs.
Keywords :
associative processing; biochemistry; bioinformatics; bone; correlation methods; data analysis; data visualisation; demography; entropy; genetics; genomics; geriatrics; graph theory; lab-on-a-chip; medical computing; minimisation; molecular biophysics; muscle; noise; ontologies (artificial intelligence); statistical analysis; GO database; GO term annotation; GO tree; biological interpretation; biological process; cellular location; circle packing graph; confidence measure derivation; differential gene expression; distribution-based entropy calculation; enrichment analysis; entropy-based statistical workflow; gender-independent aging signature; gene functional association; gene functional mapping; gene identification; gene ontology term annotation; gene regulatory role; gene set; gene signature; graph theoretic method; human skeletal muscle; integrative transcriptomic microarray dataset analysis; intuitive visualization; macroscopic phenotype; molecular annotation; multilayered computational workflow; muscular aging phenotype; noise-minimizing biological annotation; p-value threshold; simultaneously noise measurement; stable correlation; statistical testing; uncertainty calculation algorithm; Aging; Entropy; Muscles; Noise measurement; Probes; Uncertainty; aging; enrichment analysis; entropy; functional annotation; gene ontology; genes; muscle; visualization;
Conference_Titel :
Systems Biology (ISB), 2014 8th International Conference on
Conference_Location :
Qingdao
DOI :
10.1109/ISB.2014.6990749