DocumentCode
573580
Title
Feature engineering using shallow parsing in argument classification of Persian verbs
Author
Saeedi, Parisa ; Faili, Hesham
Author_Institution
ECE Dept., Univ. of Tehran, Tehran, Iran
fYear
2012
fDate
2-3 May 2012
Firstpage
333
Lastpage
338
Abstract
Identifying the verb´s dependents and determining the semantic role for them is a natural pre-processing step in applications such as machine translation (MT) and question answering (QA). In this paper, we present a feature set for assigning argument instances into thematic role classes such as “Agent” and “Patient”. This feature set contains mainly language specific features for syntactic segments (chunks) of Persian sentences which can be categorized into three feature types including verb properties, chunk content and relation between the argument and verb of a sentence. We train an instance-based classifier on our manually annotated dataset to select the appropriate semantic role of each chunk. The classifier discriminates the best semantic role without considering the interaction between chunks in a sentence. The results show that our feature set discriminates the thematic roles of arguments in a considerable accuracy about 81.9% which enhances the baseline accuracy about 18.8%. Our dataset is free release and available for the researchers.
Keywords
feature extraction; grammars; language translation; natural language processing; pattern classification; question answering (information retrieval); text analysis; Persian sentence chunks; Persian verb; argument classification; argument instance assignment; argument thematic role; chunk content; chunk semantic role; feature engineering; feature set; feature type; instance-based classifier; language specific feature; machine translation; question answering; sentence chunk interaction; shallow parsing; syntactic segments; verb dependent identification; verb property; Compounds; Educational institutions; Error analysis; Feature extraction; Semantics; Syntactics; Training data; Persian; Semantic Role Labeling; argument classification; feature set; shallow syntactic parsing; valency verb lexicon;
fLanguage
English
Publisher
ieee
Conference_Titel
Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on
Conference_Location
Shiraz, Fars
Print_ISBN
978-1-4673-1478-7
Type
conf
DOI
10.1109/AISP.2012.6313768
Filename
6313768
Link To Document