DocumentCode
538080
Title
Building and using existing hunspell dictionaries and TEX hyphenators as finite-state automata
Author
Pirinen, Tommi A. ; Lindén, Krister
Author_Institution
Dept. of Modern Languages, Univ. of Helsinki, Helsinki, Finland
fYear
2010
fDate
18-20 Oct. 2010
Firstpage
477
Lastpage
484
Abstract
There are numerous formats for writing spell-checkers for open-source systems and there are many descriptions for languages written in these formats. Similarly, for word hyphenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these spell-checking lexicons and hyphenation rule sets into finite-state automata, and present a new finite-state based system for writer´s tools used in current open-source software such as Firefox, OpenOffice.org and enchant via the spell-checking library voikko.
Keywords
dictionaries; finite automata; public domain software; TEX hyphenators; finite-state automata; hunspell dictionaries; hyphenation rule sets; open source software; open source systems; spell checking lexicons; spell checking library voikko; word hyphenation; writing spell checkers; Automata; Context; Dictionaries; Encoding; Open source software; Transducers;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location
Wisla
ISSN
2157-5525
Print_ISBN
978-1-4244-6432-6
Type
conf
DOI
10.1109/IMCSIT.2010.5679949
Filename
5679949
Link To Document