Title :
A Grammatically Structured Noun Phrase Extractor for Vietnamese
Author :
Tuan-nguyen, Hoai-Duc ; Ho, Bao-Quoc ; Bui, Tuan-Dung ; Hoang, Minh-Chau
Author_Institution :
Dept. of IS, Univ. of Natural Sci., Ho Chi Minh City, Vietnam
fDate :
Feb. 27 2012-March 1 2012
Abstract :
Noun phrase (NP) extraction is a vital part of any Natural Language Processing (NLP) system. However, it would be much better if the system can also parse the grammar structure of the extracted NPs. Grammatically structured NP (GSNP) is helpful in many research fields (Conceptual Indexing, Syntactic variant generating, Nested NP identifying, etc). This paper introduces a system that extracts NPs from Vietnamese Documents and parses each NP into a tree representing its grammar structure. These trees, in one hand, can be saved as XML documents, and in the other hand, can be loaded from these XML documents by some particular Java classes.
Keywords :
XML; grammars; natural language processing; tree data structures; GSNP extraction; Java class; NLP system; Vietnamese document; XML document; grammar structure; grammatically structured noun phrase extractor; natural language processing; Educational institutions; Grammar; Learning systems; Measurement; Natural language processing; Tagging; Training data;
Conference_Titel :
Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2012 IEEE RIVF International Conference on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-1-4673-0307-1
DOI :
10.1109/rivf.2012.6169837