DocumentCode
2916780
Title
Large scale experiments on correction of confused words
Author
Huang, Jin Hu ; Powers, David
Author_Institution
Sch. of Inf. & Eng., Flinders Univ. of South Australia, Bedford Park, SA, Australia
fYear
2001
fDate
2001
Firstpage
77
Lastpage
82
Abstract
The paper describes a new approach to automatically learn contextual knowledge for spelling and grammar correction; we aim particularly to deal with cases where the words are all in the dictionary and so it is not obvious that there is an error. Traditional approaches are dictionary based, or use elementary tagging or partial parsing of the sentence to obtain context knowledge. Our approach uses affix information and only the most frequent words to reduce the complexity in terms of training time and running time for context-sensitive spelling correction. We build large scale confused word sets based on keyboard adjacency and apply our new approach to learn the contextual knowledge to detect and correct them. We explore the performance of auto-correction under conditions where significance and probability are set by the user
Keywords
grammars; linguistics; spelling aids; text analysis; affix information; auto-correction; automatic learning; confused word correction; context knowledge; context-sensitive spelling correction; contextual knowledge; elementary tagging; grammar correction; keyboard adjacency; large scale confused word sets; large scale experiments; most frequent words; partial parsing; probability; running time; spelling correction; training time; Dictionaries; Error analysis; Error correction; Frequency; Humans; Informatics; Keyboards; Knowledge engineering; Large-scale systems; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science Conference, 2001. ACSC 2001. Proceedings. 24th Australasian
Conference_Location
Gold Coast, Qld.
ISSN
1530-0900
Print_ISBN
0-7695-0963-0
Type
conf
DOI
10.1109/ACSC.2001.906626
Filename
906626
Link To Document