• DocumentCode
    2176187
  • Title

    An efficient dictionary access method for morphological analysis

  • Author

    Ando, Kazuaki ; Tsuji, Takako ; Fuketa, Masao ; Aoe, Jun-Ichi

  • Author_Institution
    Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
  • Volume
    3
  • fYear
    1998
  • fDate
    11-14 Oct 1998
  • Firstpage
    2876
  • Abstract
    Proposes an efficient dictionary access method for morphological analysis of oriental languages by extending Aho and Corasick´s (1990) pattern matching machine. The proposed method is a simple and efficient algorithm to find all possible substrings in an input sentence and during a single pass. It stores the relations of grammatical connectivity of adjacent words into the output functions. Therefore, the costs of checking connections between the adjacent words can be reduced by using the connectivity relations. Furthermore, the method of constructing the grammatical connectivity relations is described. Finally, the proposed method is verified by theoretical analysis and an experimental estimation is supported by a computer simulation with a 100,000-word dictionary. From the simulation results, it turns out that the proposed method is 49.9% faster (in CPU time) than the traditional trie approach. In addition, the number of candidates for checking connections was 25.5% less than that of the original morphological analysis
  • Keywords
    dictionaries; digital simulation; linguistics; natural languages; string matching; CPU time; adjacent words; computer simulation; connection checking costs; dictionary access method; grammatical connectivity; input sentence substrings; morphological analysis; oriental languages; output functions; pattern matching machine; tries; AC machines; Computer simulation; Costs; Dictionaries; Information analysis; Information science; Intelligent systems; Machine intelligence; Natural language processing; Natural languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-4778-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1998.725099
  • Filename
    725099