Title :
Using contextual information to clarify Gene Normalization ambiguity
Author :
Lai, Po-Ting ; Bow, Yue-Yang ; Huang, Chi-Hsin ; Dai, Hong-Jie ; Tsai, Richard Tzong-Han ; Hsu, Wen-Lian
Author_Institution :
Dept. of Comput. Sci. & Eng., Yuan Ze Univ., Chungli, Taiwan
Abstract :
The goal of gene normalization (GN) is to identify the unique database identifiers of genes and proteins mentioned in biomedical literature. A major difficulty in GN comes from inter-species gene ambiguity. That is, the same gene name can refer to different database identifiers depending on the species in question. In this paper, we introduce a method to exploit contextual information in an abstract, like tissue type, chromosome location, etc., to tackle this problem. Using this technique, we have been able to improve system performance (F-score) by 14.3% on the BioCreAtIvE-II GN task test set.
Keywords :
biology computing; database management systems; genetics; BioCreAtIvE-II GN task test set; biomedical literature; contextual information; gene normalization ambiguity; inter-species gene ambiguity; unique database identifiers; Biological cells; Biomedical engineering; Computer science; Databases; Filters; Humans; Information science; Iron; Protein engineering; System performance;
Conference_Titel :
Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-4114-3
Electronic_ISBN :
978-1-4244-4116-7
DOI :
10.1109/IRI.2009.5211619