Title :
Building and using existing hunspell dictionaries and TEX hyphenators as finite-state automata
Author :
Pirinen, Tommi A. ; Lindén, Krister
Author_Institution :
Dept. of Modern Languages, Univ. of Helsinki, Helsinki, Finland
Abstract :
There are numerous formats for writing spell-checkers for open-source systems and there are many descriptions for languages written in these formats. Similarly, for word hyphenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these spell-checking lexicons and hyphenation rule sets into finite-state automata, and present a new finite-state based system for writer´s tools used in current open-source software such as Firefox, OpenOffice.org and enchant via the spell-checking library voikko.
Keywords :
dictionaries; finite automata; public domain software; TEX hyphenators; finite-state automata; hunspell dictionaries; hyphenation rule sets; open source software; open source systems; spell checking lexicons; spell checking library voikko; word hyphenation; writing spell checkers; Automata; Context; Dictionaries; Encoding; Open source software; Transducers;
Conference_Titel :
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location :
Wisla
Print_ISBN :
978-1-4244-6432-6
DOI :
10.1109/IMCSIT.2010.5679949