Title of article :
An evaluation of retrieval effectiveness using spelling-correction and string-similarity matching methods on Malay texts
Author/Authors :
Zainab Abu Bakar، نويسنده , , Tengku Mohd T. Sembok، نويسنده , , Mohammed Yusoff، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2000
Pages :
16
From page :
691
To page :
706
Abstract :
This article evaluates the effectiveness of spelling-correction and string-similarity matching methods in retrieving similar words in a Malay dictionary associated with a set of query words. The spelling-correction techniques used are SPEEDCOP, Soundex, Davidson, Phonix, and Hartlib. Two dynamic-programming methods that measure longest common subsequence and editcost-distance are used. Several search combinations of query and dictionary words are performed in the experiments, the best being one that stems both query and dictionary words using an existing Malay stemming algorithm. The retrieval effectiveness (E) and retrieved and relevant (R&R) mean measures are calculated from weighted combination of recall and precision values. Results from these experiments are then compared with available digram, a string-similarity method. The best R&R and E results are given by using digram. Editcost-distances produce the best E results, and both dynamic-programming methods rank second in finding R&R mean measures.
Journal title :
Journal of the American Society for Information Science and Technology
Serial Year :
2000
Journal title :
Journal of the American Society for Information Science and Technology
Record number :
993039
Link To Document :
بازگشت