• DocumentCode
    538080
  • Title

    Building and using existing hunspell dictionaries and TEX hyphenators as finite-state automata

  • Author

    Pirinen, Tommi A. ; Lindén, Krister

  • Author_Institution
    Dept. of Modern Languages, Univ. of Helsinki, Helsinki, Finland
  • fYear
    2010
  • fDate
    18-20 Oct. 2010
  • Firstpage
    477
  • Lastpage
    484
  • Abstract
    There are numerous formats for writing spell-checkers for open-source systems and there are many descriptions for languages written in these formats. Similarly, for word hyphenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these spell-checking lexicons and hyphenation rule sets into finite-state automata, and present a new finite-state based system for writer´s tools used in current open-source software such as Firefox, OpenOffice.org and enchant via the spell-checking library voikko.
  • Keywords
    dictionaries; finite automata; public domain software; TEX hyphenators; finite-state automata; hunspell dictionaries; hyphenation rule sets; open source software; open source systems; spell checking lexicons; spell checking library voikko; word hyphenation; writing spell checkers; Automata; Context; Dictionaries; Encoding; Open source software; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
  • Conference_Location
    Wisla
  • ISSN
    2157-5525
  • Print_ISBN
    978-1-4244-6432-6
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2010.5679949
  • Filename
    5679949