• DocumentCode
    1784879
  • Title

    Improving Kernel-based protein-protein interaction extraction by unsupervised word representation

  • Author

    Lishuang Li ; Rui Guo ; Zhenchao Jiang ; Degen Huang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Dalian Univ. of Technol., Dalian, China
  • fYear
    2014
  • fDate
    2-5 Nov. 2014
  • Firstpage
    379
  • Lastpage
    384
  • Abstract
    As an important branch of biomedical information extraction, Protein-Protein Interaction extraction (PPIe) from biomedical literatures has been widely researched, and machine learning methods have achieved great success for this task. However, the word feature generally adopted in the existing methods suffers badly from vocabulary gap and data sparseness, weakening the classification performance. In this paper, the unsupervised word representation approach is introduced to address these problems. Three word representation methods are adopted to improve the performance of PPIe: distributed representation, vector clustering and Brown clusters representation. Experimental results show that our method outperforms the state-of-the-art methods on five publicly available corpora.
  • Keywords
    biology computing; molecular biophysics; proteins; unsupervised learning; Brown clusters representation; biomedical information extraction; data sparseness; distributed representation; kernel-based protein-protein interaction extraction; machine learning methods; unsupervised word representation; vector clustering; vocabulary gap; Clustering algorithms; Context; Data mining; Feature extraction; Kernel; Proteins; Vectors; Brown clusters; Protein-Protein Interaction; distributed representation; word representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
  • Conference_Location
    Belfast
  • Type

    conf

  • DOI
    10.1109/BIBM.2014.6999188
  • Filename
    6999188