DocumentCode :
3256146
Title :
Evaluating Distributional Semantic and Feature Selection for Extracting Relationships from Biological Text
Author :
Emadzadeh, Ehsan ; Jonnalagadda, Siddhartha ; Gonzalez, Graciela
Author_Institution :
Dept. of Biomed. Inf., Arizona State Univ., Tempe, AZ, USA
Volume :
2
fYear :
2011
fDate :
18-21 Dec. 2011
Firstpage :
66
Lastpage :
71
Abstract :
The constant flow of biomolecular findings being published each day challenges our ability to develop methods to automatically extract the knowledge expressed in text to potentially influence new discoveries. Finding relations between the biological entities (e.g. proteins and genes) in text is a challenging task. To facilitate the extraction process, a relation can be decomposed into a trigger and the complementary arguments (e.g. theme, site). Several approaches have been proposed based on machine learning which generally use a common set of features for all trigger types. Here we evaluate the impact of applying a feature selection method for trigger classification. Our proposed method uses a greedy feature selection algorithm to find an optimal set of attributes for each trigger type. We show that using the customized set of features can improve classification results significantly (up to 53.96% in f-measure). In addition, we evaluated different settings for including semantic features in the classifiers. We found that using semantic features can improve classification results and found the best setting for each trigger type.
Keywords :
biology computing; greedy algorithms; learning (artificial intelligence); text analysis; biological entities; biological text; biomolecular findings; distributional semantic evaluation; extraction process; greedy feature selection algorithm; machine learning; relationship extraction; Feature extraction; Proteins; Semantics; Support vector machines; Testing; Training; Vectors; Distributional Semantic; Feature selection; NLP; Relation Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-2134-2
Type :
conf
DOI :
10.1109/ICMLA.2011.65
Filename :
6147050
Link To Document :
بازگشت