DocumentCode
2947576
Title
Extraction of protein interaction information from unstructured text using a link grammar parser
Author
Seoud, Rania A Abul ; Youssef, Abou-Bakr M. ; Kadah, Yasser M.
Author_Institution
Fayoum Univ., Fayoum
fYear
2007
fDate
27-29 Nov. 2007
Firstpage
70
Lastpage
75
Abstract
As research continues to generate vast amounts of data, pertaining to protein interactions, there is a critical need to capture these results in structured formats permitting for computational analysis. Automated the extraction of interactions from unstructured text, would improve the content of databases that store this information and set a method for managing the continued growth of new literature being published. Many algorithms have been reported for extracting biochemical interactions from biomedical text. Natural language processing approaches at various complexity levels have been recorded for extracting biochemical interactions from biomedical text. Some algorithms used simple template matching, others exploit sophisticated parsing techniques. In this paper, we present an automated NLP-based information extraction system, to identify protein interactions in biomedical text. Link grammar parsing can handle many syntactic structures and is computationally relatively efficient. Customizing the parser for the biomedical domain is expected to improve its performance further. Our approach is based on first, tagging biological entities with the help of biomedical and linguistic protein names databases. The system extracts complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations.
Keywords
biochemistry; computational linguistics; database management systems; grammars; information retrieval; medical computing; medical information systems; natural language processing; pattern matching; proteins; text analysis; automated natural language processing; biochemical interaction extraction; biomedical database; linguistic protein names databases; link grammar parser; protein interaction information extraction system; template matching; unstructured biomedical text; Bioinformatics; Biology computing; Biomedical computing; Biomedical engineering; Data engineering; Data mining; Databases; Humans; Natural language processing; Protein engineering; Bioinformatics; Information Extraction; Link Grammar; Natural Language Processing; Protein-Protein Interactions;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Engineering & Systems, 2007. ICCES '07. International Conference on
Conference_Location
Cairo
Print_ISBN
978-1-4244-1365-2
Electronic_ISBN
978-1-1244-1366-9
Type
conf
DOI
10.1109/ICCES.2007.4447028
Filename
4447028
Link To Document