Title of article :
A Novel Sample Selection Strategy for Imbalanced Data of Biomedical Event Extraction with Joint Scoring Mechanism
Author/Authors :
Lu, Yang Jilin University - Changchun - Jilin, China , Ma, Xiaolei Jilin University - Changchun - Jilin, China , Lu, Yinan Jilin University - Changchun - Jilin, China , Zhou, Yuxin Jilin University - Changchun - Jilin, China , Pei, Zhili Inner Mongolia University for Nationalities - Tongliao - Inner Mongolia, China
Abstract :
Biomedical event extraction is an important and difficult task in bioinformatics. With the rapid growth of biomedical literature, the
extraction of complex events from unstructured text has attracted more attention. However, the annotated biomedical corpus is
highly imbalanced, which affects the performance of the classification algorithms. In this study, a sample selection algorithm based
on sequential pattern is proposed to filter negative samples in the training phase. Considering the joint information between the
trigger and argument of multiargument events, we extract triplets of multiargument events directly using a support vector machine
classifier. A joint scoring mechanism, which is based on sentence similarity and importance of trigger in the training data, is used
to correct the predicted results. Experimental results indicate that the proposed method can extract events efficiently.
Keywords :
Data , Biomedical Event Extraction , BioNLP , BIND
Journal title :
Computational and Mathematical Methods in Medicine