DocumentCode :
2850896
Title :
Language Oriented Parsing Through Morphologically Closed Word Classes in Urdu
Author :
Rizvi, S. M Jafar ; Husssain, M. ; Qaiser, Naeem
Author_Institution :
Department of Computer & Information Sciences, Pakistan Institute of Engineering & Applied Sciences (PIEAS), Islamabad, Pakistan. JafarRizvi@Gmail.com
fYear :
2004
fDate :
30-31 Dec. 2004
Firstpage :
19
Lastpage :
24
Abstract :
To find correctness of the grammatical structure of a natural language sentence, unambiguous parse is the basic requirement. Therefore, Parsing of the source language plays a key role for reliable machine translation. In this paper a language oriented parsing algorithm is presented for Urdu language sentences by initiating tagging only for morphologically closed classes of words like postpositions, conjunctions, verb morphemes, etc. By utilizing linguistics features of these closed classes neighbor words are collected into chunks. The chunks are formed by applying grammar rules in order through ordered context free grammar. Finally, full parsing on chunks is achieved which have much lesser search space. The functional structures are unified throughout the process of chunking and parsing to support the correctness of parsing. It is found that use of closed classes ahead of open classes and chunking process reduces the number of grammar rules and enhances the reliability of final parse.
Keywords :
Chunking; Machine Translation; Open and Closed Classes of Words; Parsing; Urdu Morphology; Learning systems; Manuals; Morphology; Natural language processing; Natural languages; Reliability engineering; Tagging; White spaces; Chunking; Machine Translation; Open and Closed Classes of Words; Parsing; Urdu Morphology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering, Sciences and Technology, Student Conference On
Print_ISBN :
0-7803-8871-2
Type :
conf
DOI :
10.1109/SCONES.2004.1564762
Filename :
1564762
Link To Document :
بازگشت