DocumentCode
782242
Title
A comparison of standard spell checking algorithms and a novel binary neural approach
Author
Hodge, Victoria J. ; Austin, Jim
Author_Institution
Dept. of Comput. Sci., York Univ., UK
Volume
15
Issue
5
fYear
2003
Firstpage
1073
Lastpage
1081
Abstract
In this paper, we propose a simple, flexible, and efficient hybrid spell checking methodology based upon phonetic matching, supervised learning, and associative matching in the AURA neural system. We integrate Hamming Distance and n-gram algorithms that have high recall for typing errors and a phonetic spell-checking algorithm in a single novel architecture. Our approach is suitable for any spell checking application though aimed toward isolated word error correction, particularly spell checking user queries in a search engine. We use a novel scoring scheme to integrate the retrieved words from each spelling approach and calculate an overall score for each matched word. From the overall scores, we can rank the possible matches. We evaluate our approach against several benchmark spellchecking algorithms for recall accuracy. Our proposed hybrid methodology has the highest recall rate of the techniques evaluated. The method has a high recall rate and low-computational cost.
Keywords
learning (artificial intelligence); neural nets; pattern matching; spelling aids; AURA neural system; Hamming Distance; associative matching; benchmark spellchecking algorithms; binary neural approach; isolated word error correction; n-gram algorithms; phonetic matching; phonetic spell-checking algorithm; recall accuracy; recall rate; scoring scheme; search engine; spell checking application; standard spell checking algorithms; supervised learning; user queries; Costs; Error correction; Hamming distance; Humans; Internet; Neural networks; Robustness; Search engines; Supervised learning;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2003.1232265
Filename
1232265
Link To Document