• DocumentCode
    573580
  • Title

    Feature engineering using shallow parsing in argument classification of Persian verbs

  • Author

    Saeedi, Parisa ; Faili, Hesham

  • Author_Institution
    ECE Dept., Univ. of Tehran, Tehran, Iran
  • fYear
    2012
  • fDate
    2-3 May 2012
  • Firstpage
    333
  • Lastpage
    338
  • Abstract
    Identifying the verb´s dependents and determining the semantic role for them is a natural pre-processing step in applications such as machine translation (MT) and question answering (QA). In this paper, we present a feature set for assigning argument instances into thematic role classes such as “Agent” and “Patient”. This feature set contains mainly language specific features for syntactic segments (chunks) of Persian sentences which can be categorized into three feature types including verb properties, chunk content and relation between the argument and verb of a sentence. We train an instance-based classifier on our manually annotated dataset to select the appropriate semantic role of each chunk. The classifier discriminates the best semantic role without considering the interaction between chunks in a sentence. The results show that our feature set discriminates the thematic roles of arguments in a considerable accuracy about 81.9% which enhances the baseline accuracy about 18.8%. Our dataset is free release and available for the researchers.
  • Keywords
    feature extraction; grammars; language translation; natural language processing; pattern classification; question answering (information retrieval); text analysis; Persian sentence chunks; Persian verb; argument classification; argument instance assignment; argument thematic role; chunk content; chunk semantic role; feature engineering; feature set; feature type; instance-based classifier; language specific feature; machine translation; question answering; sentence chunk interaction; shallow parsing; syntactic segments; verb dependent identification; verb property; Compounds; Educational institutions; Error analysis; Feature extraction; Semantics; Syntactics; Training data; Persian; Semantic Role Labeling; argument classification; feature set; shallow syntactic parsing; valency verb lexicon;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Signal Processing (AISP), 2012 16th CSI International Symposium on
  • Conference_Location
    Shiraz, Fars
  • Print_ISBN
    978-1-4673-1478-7
  • Type

    conf

  • DOI
    10.1109/AISP.2012.6313768
  • Filename
    6313768