DocumentCode :
2947576
Title :
Extraction of protein interaction information from unstructured text using a link grammar parser
Author :
Seoud, Rania A Abul ; Youssef, Abou-Bakr M. ; Kadah, Yasser M.
Author_Institution :
Fayoum Univ., Fayoum
fYear :
2007
fDate :
27-29 Nov. 2007
Firstpage :
70
Lastpage :
75
Abstract :
As research continues to generate vast amounts of data, pertaining to protein interactions, there is a critical need to capture these results in structured formats permitting for computational analysis. Automated the extraction of interactions from unstructured text, would improve the content of databases that store this information and set a method for managing the continued growth of new literature being published. Many algorithms have been reported for extracting biochemical interactions from biomedical text. Natural language processing approaches at various complexity levels have been recorded for extracting biochemical interactions from biomedical text. Some algorithms used simple template matching, others exploit sophisticated parsing techniques. In this paper, we present an automated NLP-based information extraction system, to identify protein interactions in biomedical text. Link grammar parsing can handle many syntactic structures and is computationally relatively efficient. Customizing the parser for the biomedical domain is expected to improve its performance further. Our approach is based on first, tagging biological entities with the help of biomedical and linguistic protein names databases. The system extracts complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations.
Keywords :
biochemistry; computational linguistics; database management systems; grammars; information retrieval; medical computing; medical information systems; natural language processing; pattern matching; proteins; text analysis; automated natural language processing; biochemical interaction extraction; biomedical database; linguistic protein names databases; link grammar parser; protein interaction information extraction system; template matching; unstructured biomedical text; Bioinformatics; Biology computing; Biomedical computing; Biomedical engineering; Data engineering; Data mining; Databases; Humans; Natural language processing; Protein engineering; Bioinformatics; Information Extraction; Link Grammar; Natural Language Processing; Protein-Protein Interactions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering & Systems, 2007. ICCES '07. International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-1365-2
Electronic_ISBN :
978-1-1244-1366-9
Type :
conf
DOI :
10.1109/ICCES.2007.4447028
Filename :
4447028
Link To Document :
بازگشت