DocumentCode
2850896
Title
Language Oriented Parsing Through Morphologically Closed Word Classes in Urdu
Author
Rizvi, S. M Jafar ; Husssain, M. ; Qaiser, Naeem
Author_Institution
Department of Computer & Information Sciences, Pakistan Institute of Engineering & Applied Sciences (PIEAS), Islamabad, Pakistan. JafarRizvi@Gmail.com
fYear
2004
fDate
30-31 Dec. 2004
Firstpage
19
Lastpage
24
Abstract
To find correctness of the grammatical structure of a natural language sentence, unambiguous parse is the basic requirement. Therefore, Parsing of the source language plays a key role for reliable machine translation. In this paper a language oriented parsing algorithm is presented for Urdu language sentences by initiating tagging only for morphologically closed classes of words like postpositions, conjunctions, verb morphemes, etc. By utilizing linguistics features of these closed classes neighbor words are collected into chunks. The chunks are formed by applying grammar rules in order through ordered context free grammar. Finally, full parsing on chunks is achieved which have much lesser search space. The functional structures are unified throughout the process of chunking and parsing to support the correctness of parsing. It is found that use of closed classes ahead of open classes and chunking process reduces the number of grammar rules and enhances the reliability of final parse.
Keywords
Chunking; Machine Translation; Open and Closed Classes of Words; Parsing; Urdu Morphology; Learning systems; Manuals; Morphology; Natural language processing; Natural languages; Reliability engineering; Tagging; White spaces; Chunking; Machine Translation; Open and Closed Classes of Words; Parsing; Urdu Morphology;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering, Sciences and Technology, Student Conference On
Print_ISBN
0-7803-8871-2
Type
conf
DOI
10.1109/SCONES.2004.1564762
Filename
1564762
Link To Document