DocumentCode :
538080
Title :
Building and using existing hunspell dictionaries and TEX hyphenators as finite-state automata
Author :
Pirinen, Tommi A. ; Lindén, Krister
Author_Institution :
Dept. of Modern Languages, Univ. of Helsinki, Helsinki, Finland
fYear :
2010
fDate :
18-20 Oct. 2010
Firstpage :
477
Lastpage :
484
Abstract :
There are numerous formats for writing spell-checkers for open-source systems and there are many descriptions for languages written in these formats. Similarly, for word hyphenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these spell-checking lexicons and hyphenation rule sets into finite-state automata, and present a new finite-state based system for writer´s tools used in current open-source software such as Firefox, OpenOffice.org and enchant via the spell-checking library voikko.
Keywords :
dictionaries; finite automata; public domain software; TEX hyphenators; finite-state automata; hunspell dictionaries; hyphenation rule sets; open source software; open source systems; spell checking lexicons; spell checking library voikko; word hyphenation; writing spell checkers; Automata; Context; Dictionaries; Encoding; Open source software; Transducers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location :
Wisla
ISSN :
2157-5525
Print_ISBN :
978-1-4244-6432-6
Type :
conf
DOI :
10.1109/IMCSIT.2010.5679949
Filename :
5679949
Link To Document :
بازگشت