Title :
A Study on Studies: Exploring the Metadata Associated with dbGaP Studies
Author :
Truong, Karen ; Conway, Mike
Author_Institution :
Dept. of Med., Univ. of California, San Diego, La Jolla, CA, USA
Abstract :
The database of Genotypes and Phenotypes (dbGaP) was developed by the National Heart Lung, and Blood Institute (NHLBI) to archive genome-wide association studies (GWAS) data. As of July 17th 2012, dbGaP contained 305 top-level studies. The metadata for each study (available from the dbGaP website) are organized into distinct sections, including a study description, inclusion/exclusion criteria, policies for authorized access requests, MeSH terms, PubMed identifiers, study histories, and the names of principal and co-investigators. We here tabulate the salient characteristics of dbGaP metadata as part of the Phenotype Discoverer (PhD) project, a research project at the University of California San Diego Division of Biomedical Informatics which aims to enhance the "searchability" of the current dbGaP website through the alignment of phenotypes to a standard information model. In particular, we are interested in using the extracted metadata PubMed identifiers, principal investigator names, associated journal names, etc. as input to a statistical text.
Keywords :
Web sites; authorisation; bioinformatics; genomics; information retrieval; meta data; statistical analysis; text analysis; GWAS data; MeSH terms; NHLBI; National Heart Lung and Blood Institute; PhD project; Phenotype Discoverer project; PubMed identifiers; University of California San Diego Division of Biomedical Informatics; authorized access request policy; database of genotypes and phenotypes; dbGaP Website; dbGaP studies; genome-wide association studies data; inclusion-exclusion criteria; metadata; research project; searchability enhancement; standard information model; statistical text; study description; study histories; Biomedical informatics; Blood; Databases; Diabetes; Educational institutions; Heart; Lungs;
Conference_Titel :
Healthcare Informatics, Imaging and Systems Biology (HISB), 2012 IEEE Second International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-4803-4
DOI :
10.1109/HISB.2012.51