DocumentCode :
2513693
Title :
Applying Feature Coupling Generalization for Protein-Protein Interaction Extraction
Author :
Li, Yanpeng ; Lin, Hongfei ; Yang, Zhihao
Author_Institution :
Dept. of Comput. Sci. & Eng., Dalian Univ. of Technol., Dalian, China
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
396
Lastpage :
400
Abstract :
We present the application of a recently proposed semi-supervised learning strategy - feature coupling generalization (FCG) - in the task of protein-protein interaction extraction from biomedical literatures. FCG is a framework that generates new features from relatedness of two special types of old features: example-distinguishing features (EDFs) and class-distinguishing features (CDFs). Their relatedness estimated from unlabeled data tends to capture indicative information not available in labeled data. For this task, we designed several EDFs and CDFs derived from the text patterns surrounding the co-occurrence proteins, and combined the new features generated by FCG with Boolean lexical features. The experimental results on AIMED corpus show that the new features yield significant improvement over a strong baseline, and the combined method achieves state-of-the-art performance without using any syntactic information.
Keywords :
Boolean functions; molecular biophysics; proteins; Boolean lexical features; class-distinguishing features; example-distinguishing features; feature coupling generalization; protein-protein interaction extraction; semisupervised learning strategy; syntactic information; Application software; Bioinformatics; Biomedical engineering; Computer science; Data mining; Kernel; Machine learning; National electric code; Protein engineering; Semisupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-0-7695-3885-3
Type :
conf
DOI :
10.1109/BIBM.2009.65
Filename :
5341746
Link To Document :
بازگشت