Title :
Classifiers combination to arabic morphosyntactic disambiguation
Author :
Albared, Mohammed ; Omar, Nazlia ; Aziz, Mohd J Ab
Author_Institution :
Fac. of Inf. Sci. & Technol., Nat. Univ. of Malaysia, Bangi, Malaysia
Abstract :
Parts of speech tagging forms the important pre-processing step in many of the natural language processing applications like text summarization, question answering and information retrieval system. MorphoSyntactic disambiguation (part of speech tagging) is the process of classifying every word in a given context to its appropriate part of speech. In this paper, we first review all the supervised machine learning approaches that have been used in the part of speech tagging. Then we review all the Arabic works to compare and to confirm our need to develop an accurate and efficient Arabic morphosyntactic disambiguation system. Finally we propose a classifiers combination experimental framework for Arabic part of speech tagger in which three diverse probabilistic classifiers (hidden Markov, maximum entropy and transformation based learning) are combined using many different combination strategies to exploit their advantages.
Keywords :
learning (artificial intelligence); natural language processing; pattern classification; Arabic morphosyntactic disambiguation; classifiers combination; information retrieval system; natural language processing; parts of speech tagging; question answering; supervised machine learning approach; text summarization; three diverse probabilistic classifier; Entropy; Hidden Markov models; Informatics; Information retrieval; Information science; Machine learning; Natural language processing; Natural languages; Speech processing; Tagging; MorphoSyntactic disambiguation; machine learning; natural language processing;
Conference_Titel :
Electrical Engineering and Informatics, 2009. ICEEI '09. International Conference on
Conference_Location :
Selangor
Print_ISBN :
978-1-4244-4913-2
DOI :
10.1109/ICEEI.2009.5254797