Title :
Computing the Levenshtein distance of a regular language
Author :
Konstantinidis, Stavros
Author_Institution :
Dept. of Math. & Comput. Sci., Saint Mary´´s Univ., Halifax, NS, Canada
fDate :
29 Aug.-1 Sept. 2005
Abstract :
The edit distance (or Levenshtein distance) between two words is the smallest number of substitutions, insertions, and deletions of symbols that can be used to transform one of the words into the other. In this paper we consider the problem of computing the edit distance of a regular language (also known as constraint system), that is, the set of words accepted by a given finite automaton. This quantity is the smallest edit distance between any pair of distinct words of the language. We show that the problem is of polynomial time complexity. We distinguish two cases depending on whether the given automaton is deterministic or nondeterministic. In the latter case the time complexity is higher.
Keywords :
computational complexity; formal languages; Levenshtein distance; constraint system; edit distance; finite automaton; polynomial time complexity; regular language; Automata; Data communication; Dynamic programming; Error correction; Hamming distance; Information processing; Mathematics; Natural languages; Polynomials; Speech recognition;
Conference_Titel :
Information Theory Workshop, 2005 IEEE
Print_ISBN :
0-7803-9480-1
DOI :
10.1109/ITW.2005.1531868