DocumentCode
1955410
Title
Anaphora Resolution of Malay Text: Issues and Proposed Solution Model
Author
Noor, Noorhuzaimi Karimah Mohd ; Aziz, Mohd Juzaidin Ab ; Noah, Shahrul Azman ; Hamzah, Mohd Pouzi
Author_Institution
Fac. of Comput. Syst. & Software Eng., Univ. Malaysia Pahang, Kuantan, Malaysia
fYear
2010
fDate
28-30 Dec. 2010
Firstpage
174
Lastpage
177
Abstract
Anaphora resolution (AR) is a process to identify the appropriate antecedent with its anaphor which occur before the anaphor. AR able to improve most of the NLP applications such as question answering, short answer examination system and information extraction. Most of AR systems are deal with English language. Thus, in 1990´s the research on AR has been applied for other language, such as Arabic, Chinese, Hindi and Norwegian. There are however limited or no effort in dealing with Malay text. The AR systems for one language cannot be simply adapted to use in other languages. This is due to the fact that different languages have different set of rules relating to syntax and semantic to respective language. This paper proposed a model for resolving anaphora phenomena in Malay text. The model consists of three elements consisting of anaphora resolution process, syntactic knowledge process and semantic-world knowledge process. The elements are defined based on the observable fact occurring in Malay language.
Keywords
natural language processing; question answering (information retrieval); text analysis; Malay Text; Malay text; anaphora resolution process; information extraction; question answering; semantic-world knowledge process; short answer examination system; syntactic knowledge process; Animals; Computational linguistics; Computational modeling; Electronic mail; Humans; Semantics; Syntactics; Malay text; model of anaphora resolution; natural language processing; poor-knowledge anaphora resolution;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location
Harbin
Print_ISBN
978-1-4244-9063-9
Type
conf
DOI
10.1109/IALP.2010.80
Filename
5681607
Link To Document