Title :
Spelling error detector rule for Jawi stemmer
Author :
Sulaiman, Suliana ; Omar, Khairuddin ; Omar, Nazlia ; Murah, Zamri ; Rahman, H.A.
Author_Institution :
Pattern Recognition Res. Lab., Univ. Kebangsaan Malaysia, Bangi, Malaysia
Abstract :
Stemmer is important especially for information and document retrieval. It can also help to reduce the size of the dictionary. Normally Malay stemmers need to have a root word dictionary to increase the stemmer´s accuracy. In Jawi stemmer, we use Jawi spelling error rule to detect whether the program produces the correct stemmed word after all possible affixes have been removed. Jawi spelling error rule has been tested using 3018 data in Jawi with two syllables root word and the result was compared manually. The result shows 97.8% accuracy of Jawi spelling word with two syllables which have been checked correctly using the `spelling error detector rule´.
Keywords :
dictionaries; information retrieval; Jawi stemmer; Malay stemmers; document retrieval; information retrieval; root word dictionary; spelling error detector rule; Accuracy; Computational linguistics; Detectors; Dictionaries; Inference algorithms; Information science; Morphology; Jawi; Spelling error detector rule;
Conference_Titel :
Pattern Analysis and Intelligent Robotics (ICPAIR), 2011 International Conference on
Conference_Location :
Putrajaya
Print_ISBN :
978-1-61284-407-7
DOI :
10.1109/ICPAIR.2011.5976915