• DocumentCode
    1925697
  • Title

    An Example Based Approach for Parsing Natural Language Sentences

  • Author

    Chatterjee, Niladri ; Goyal, Shailly

  • Author_Institution
    Dept. of Mathe., Indian Inst. of Technol., New Delhi
  • fYear
    2007
  • fDate
    5-7 March 2007
  • Firstpage
    451
  • Lastpage
    457
  • Abstract
    Development of a parser from scratch is typically time-consuming and error prone. A more efficient approach may lie in adapting an example-based parsing scheme, provided an appropriate strategy is developed for knowledge elicitation from the example base. This paper proposes such a scheme for developing a Link Grammar based parser. The proposed scheme extracts knowledge from an already parsed example base in the form of "link information" and "phrase templates". For a given input sentence the parsing algorithm first extracts the possible links for the constituent words from the link dictionary. Then using the phrase templates and the possible links of the words, the sentence is parsed bottom-up. Algorithms have been developed to identify different phrases in the input sentence and to handle unknown words. The examples considered in this work are for English sentence. The proposed scheme uses very little language-specific information. Hence, the proposed algorithms can be adapted for other languages as well
  • Keywords
    grammars; knowledge acquisition; natural language processing; English sentence; Link Grammar based parser; example base; example-based parsing scheme; knowledge elicitation; knowledge extraction; language-specific information; link dictionary; link information; natural language sentence; phrase templates; Artificial intelligence; Computer applications; Data mining; Dictionaries; Joining processes; Mathematics; Natural language processing; Natural languages; Probability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing: Theory and Applications, 2007. ICCTA '07. International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    0-7695-2770-1
  • Type

    conf

  • DOI
    10.1109/ICCTA.2007.28
  • Filename
    4127411