DocumentCode :
573580
Title :
Feature engineering using shallow parsing in argument classification of Persian verbs
Author :
Saeedi, Parisa ; Faili, Hesham
Author_Institution :
ECE Dept., Univ. of Tehran, Tehran, Iran
fYear :
2012
fDate :
2-3 May 2012
Firstpage :
333
Lastpage :
338
Abstract :
Identifying the verb´s dependents and determining the semantic role for them is a natural pre-processing step in applications such as machine translation (MT) and question answering (QA). In this paper, we present a feature set for assigning argument instances into thematic role classes such as “Agent” and “Patient”. This feature set contains mainly language specific features for syntactic segments (chunks) of Persian sentences which can be categorized into three feature types including verb properties, chunk content and relation between the argument and verb of a sentence. We train an instance-based classifier on our manually annotated dataset to select the appropriate semantic role of each chunk. The classifier discriminates the best semantic role without considering the interaction between chunks in a sentence. The results show that our feature set discriminates the thematic roles of arguments in a considerable accuracy about 81.9% which enhances the baseline accuracy about 18.8%. Our dataset is free release and available for the researchers.
Keywords :
feature extraction; grammars; language translation; natural language processing; pattern classification; question answering (information retrieval); text analysis; Persian sentence chunks; Persian verb; argument classification; argument instance assignment; argument thematic role; chunk content; chunk semantic role; feature engineering; feature set; feature type; instance-based classifier; language specific feature; machine translation; question answering; sentence chunk interaction; shallow parsing; syntactic segments; verb dependent identification; verb property; Compounds; Educational institutions; Error analysis; Feature extraction; Semantics; Syntactics; Training data; Persian; Semantic Role Labeling; argument classification; feature set; shallow syntactic parsing; valency verb lexicon;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on
Conference_Location :
Shiraz, Fars
Print_ISBN :
978-1-4673-1478-7
Type :
conf
DOI :
10.1109/AISP.2012.6313768
Filename :
6313768
Link To Document :
بازگشت