DocumentCode :
1125804
Title :
A High Speed String Correction Method Using a Hierarchical File
Author :
Tanaka, Eiichi ; Kojima, Yurie
Author_Institution :
Department of Information Science, Faculty of Engineering, Utsunomiya University, Utsunomiay 321, Japan.
Issue :
6
fYear :
1987
Firstpage :
806
Lastpage :
815
Abstract :
This paper describes a high speed string correction method using a hierarchical file. After reviewing a string correction method based on the Levenshtein distance, a hierarchical file construction method is introduced. A multistage string correction method using this file is proposed. The lower bound of computational complexity is estimated, and it is shown that a multistage method using a special type of a hierarchical file can reduce computational labor greatly. The larger the number of strings considered is, the more efficient the method becomes. The results of computer simulations on 5374 phoneme sequences using two and three stage correction methods are stated. The condition for a multistage string correction method to obtain higher correction rates than an ordinary dictionary method is included.
Keywords :
Biology; Computational complexity; Computer simulation; Dictionaries; Error correction; Sequences; Speech processing; Speech recognition; Statistical analysis; Vocabulary; Deletion; Levenshtein; hierarchical file; insertion; speech recognition; string comparison; string edit; substitution;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.1987.4767987
Filename :
4767987
Link To Document :
بازگشت