• DocumentCode
    1125804
  • Title

    A High Speed String Correction Method Using a Hierarchical File

  • Author

    Tanaka, Eiichi ; Kojima, Yurie

  • Author_Institution
    Department of Information Science, Faculty of Engineering, Utsunomiya University, Utsunomiay 321, Japan.
  • Issue
    6
  • fYear
    1987
  • Firstpage
    806
  • Lastpage
    815
  • Abstract
    This paper describes a high speed string correction method using a hierarchical file. After reviewing a string correction method based on the Levenshtein distance, a hierarchical file construction method is introduced. A multistage string correction method using this file is proposed. The lower bound of computational complexity is estimated, and it is shown that a multistage method using a special type of a hierarchical file can reduce computational labor greatly. The larger the number of strings considered is, the more efficient the method becomes. The results of computer simulations on 5374 phoneme sequences using two and three stage correction methods are stated. The condition for a multistage string correction method to obtain higher correction rates than an ordinary dictionary method is included.
  • Keywords
    Biology; Computational complexity; Computer simulation; Dictionaries; Error correction; Sequences; Speech processing; Speech recognition; Statistical analysis; Vocabulary; Deletion; Levenshtein; hierarchical file; insertion; speech recognition; string comparison; string edit; substitution;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.1987.4767987
  • Filename
    4767987