DocumentCode :
2513869
Title :
Biomedical Relationship Extraction from Literature Based on Bio-semantic Token Subsequences
Author :
Katukuri, Jayasimha R. ; Xie, Ying ; Raghavan, Vijay V.
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
366
Lastpage :
370
Abstract :
Relationship extraction (RE) from biomedical literature is an important and challenging problem in both text mining and bioinformatics. Although various approaches have been proposed to extract protein-protein interaction types, their accuracy rates leave a large room for further exploration of more effective methods. In this paper, two supervised learning algorithms based on newly-defined ldquobio-semantic token subsequencerdquo are proposed for multi-class biomedical relationship extraction. The first approach calculates a ldquobio-semantic token subsequence kernelrdquo, while the second one explicitly extracts weighted features from bio-semantic token subsequences. The proposed structure called ldquobio-semantic token subsequencerdquo is able to capture semantic features from natural language sentences for biomedical RE. Two supervised learning algorithms based on the proposed structure outperform the state-of-the-art biomedical RE methods on multi-class protein-protein interaction extraction.
Keywords :
bioinformatics; data mining; proteins; bio-semantic token subsequence kernel; bio-semantic token subsequences; bioinformatics; biomedical literature; multiclass biomedical relationship extraction; multiclass protein-protein interaction extraction; natural language sentence; protein-protein interaction types; supervised learning algorithm; text mining; Bioinformatics; Biomedical computing; Data mining; Feature extraction; Kernel; Machine learning algorithms; Natural languages; Proteins; Supervised learning; Text mining; Biomedical Literature; NLP; Relationship Extraction; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-0-7695-3885-3
Type :
conf
DOI :
10.1109/BIBM.2009.74
Filename :
5341759
Link To Document :
بازگشت