Title :
Text Mining for Hypotheses and Results in Translational Medicine Studies
Author :
Tsai, Terry H. ; Kasch, Niels ; Pfeifer, Craig ; Oates, Tim
Author_Institution :
Sch. of Med. Baltimore, Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Most common and complex diseases, such as diabetes and cancer, are influenced at some level by variation in the genome. To truly address the goal of translational research, genetic variation must be taken into consideration. Research done in public health genetics, specifically in the area of single nucleotide polymorphisms (SNPs), is the first step to understanding human genetic variation. In addition, novel methods are needed to represent and to conduct text mining over textual genotypic data sources. In this paper, we describe the development and evaluation, in the context of a genetic study, of a translational-informatics method that supports both machine-learning text mining (e.g., Conditional random fields) and automated inference for identifying key concepts (e.g., Hypotheses and results). After scaling for inter-annotator agreement, our adjusted overall precision was 64%, with a range of 48% to 80%. While other biological text mining systems have focused on named-entity recognition, the development of tools for genetic studies focusing on hypotheses and results has been relatively rare.
Keywords :
data mining; diseases; genetics; genomics; learning (artificial intelligence); medical information systems; medicine; text analysis; SNP; biological text mining systems; cancer; complex diseases; conditional random fields; diabetes; genome; human genetic variation; interannotator agreement; machine-learning text mining; named-entity recognition; public health genetics; single nucleotide polymorphisms; textual genotypic data sources; translational medicine studies; translational-informatics method; Bioinformatics; Diabetes; Diseases; Genomics; Medical diagnostic imaging; Text mining; Biomedical informatics; Gene-environment interaction studies; Natural language processing; Text mining; Translational informatics;
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
DOI :
10.1109/ICDMW.2014.39