• DocumentCode
    2947576
  • Title

    Extraction of protein interaction information from unstructured text using a link grammar parser

  • Author

    Seoud, Rania A Abul ; Youssef, Abou-Bakr M. ; Kadah, Yasser M.

  • Author_Institution
    Fayoum Univ., Fayoum
  • fYear
    2007
  • fDate
    27-29 Nov. 2007
  • Firstpage
    70
  • Lastpage
    75
  • Abstract
    As research continues to generate vast amounts of data, pertaining to protein interactions, there is a critical need to capture these results in structured formats permitting for computational analysis. Automated the extraction of interactions from unstructured text, would improve the content of databases that store this information and set a method for managing the continued growth of new literature being published. Many algorithms have been reported for extracting biochemical interactions from biomedical text. Natural language processing approaches at various complexity levels have been recorded for extracting biochemical interactions from biomedical text. Some algorithms used simple template matching, others exploit sophisticated parsing techniques. In this paper, we present an automated NLP-based information extraction system, to identify protein interactions in biomedical text. Link grammar parsing can handle many syntactic structures and is computationally relatively efficient. Customizing the parser for the biomedical domain is expected to improve its performance further. Our approach is based on first, tagging biological entities with the help of biomedical and linguistic protein names databases. The system extracts complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations.
  • Keywords
    biochemistry; computational linguistics; database management systems; grammars; information retrieval; medical computing; medical information systems; natural language processing; pattern matching; proteins; text analysis; automated natural language processing; biochemical interaction extraction; biomedical database; linguistic protein names databases; link grammar parser; protein interaction information extraction system; template matching; unstructured biomedical text; Bioinformatics; Biology computing; Biomedical computing; Biomedical engineering; Data engineering; Data mining; Databases; Humans; Natural language processing; Protein engineering; Bioinformatics; Information Extraction; Link Grammar; Natural Language Processing; Protein-Protein Interactions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Engineering & Systems, 2007. ICCES '07. International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-1365-2
  • Electronic_ISBN
    978-1-1244-1366-9
  • Type

    conf

  • DOI
    10.1109/ICCES.2007.4447028
  • Filename
    4447028