• DocumentCode
    1784912
  • Title

    A general instance representation architecture for protein-protein interaction extraction

  • Author

    Lishuang Li ; Zhenchao Jiang ; Degen Huang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
  • fYear
    2014
  • fDate
    2-5 Nov. 2014
  • Firstpage
    497
  • Lastpage
    500
  • Abstract
    Previous researches have shown that supervised Protein-Protein Interaction Extraction (PPIE) can get high accuracies with elaborately selected features and kernels. However, most features and kernels rest upon domain knowledge and natural language analysis, which makes the supervised model expensive, heavy and brittle. Moreover, the one-hot encoding, a commonly used representation technique, fails to capture the semantic similarity between words. To reduce the manual labor and overcome the shortage of one-hot encoding, we put forward a general instance representation architecture for PPIE, which integrates word representation and vector composition. Our method obtains F-scores of 69.4%, 78.8%, 76.0%, 74.0% and 81.1% on AIMed, BioInfer, HPRD50, IEPA and LLL respectively.
  • Keywords
    bioinformatics; feature selection; learning (artificial intelligence); natural languages; proteins; proteomics; semantic networks; domain knowledge analysis; feature selection; general instance representation architecture; kernel selection; natural language analysis; semantic similarity; supervised protein-protein interaction extraction; vector composition integration; word representation integration; Encoding; Feature extraction; Kernel; Protein engineering; Proteins; Skeleton; Vectors; Protein-Protein Interaction; instance representation; relation extraction; word representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
  • Conference_Location
    Belfast
  • Type

    conf

  • DOI
    10.1109/BIBM.2014.6999208
  • Filename
    6999208