Title :
Literature mining associations of diseases using gene ontology
Author :
Li Yang ; Yanhong Zhou ; Zhuo Tang
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
Upon the coming of post-genomics era, the research on gene function becomes especially important. Updating of the curated databases is lack of timely. There is a lot of knowledge hidden in biomedicine literatures. With the ever increasing amount of biomedicine literatures, mining the relations automatically is very urgent. Modern research has found most diseases are genetic. The relations between diseases and gene functions are waiting to be mining. This paper proposes a system to mine relations between diseases with common gene functions in the literature with normalized Metathesaurus. First, a two-phase Conditional Random Fields (CRFs) is used to recognize the disease mentions and gene function mentions, including the location and identification. Then, the paper adapts the Disease Ontology (DO) and the Gene Ontology (GO) to annotate the diseases and gene functions recognized for normalization by computing the similarity between mentions and concepts. According to the similarities, the mentions are denoted as concepts and instances distinctively. Thirdly, the paper builds a network and measure relations between diseases by computing similarities between common sub-graphs. The experiments carried out on a corpus randomly selected by GoPubMed with disease and the three domains in GO. The performance shows a lot of hidden relations between diseases and gives an explanation.
Keywords :
data mining; diseases; genetics; medical information systems; ontologies (artificial intelligence); CRF; DO; GO; GoPubMed; biomedicine literatures; conditional random fields; curated databases; disease mentions; disease ontology; gene function; gene function mentions; gene ontology; literature mining associations; metathesaurus; post-genomics era; Biology; Biomedical measurement; Diseases; Disease Ontology; Gene Ontology; disease; gene function; sub-graph; text mining;
Conference_Titel :
Computer Science & Education (ICCSE), 2013 8th International Conference on
Conference_Location :
Colombo
Print_ISBN :
978-1-4673-4464-7
DOI :
10.1109/ICCSE.2013.6554016