Title :
A data structure using hashing and tries for efficient Chinese lexical access
Author :
LAM, Yat-Kin ; Huo, Qiang
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ., China
fDate :
29 Aug.-1 Sept. 2005
Abstract :
A lexicon is needed in many applications. In the past, different structures such as tries, hash tables and their variants have been investigated for lexicon organization and lexical access. In this paper, we propose a new data structure that combines the use of hash table and tries for storing a Chinese lexicon. The data structure facilitates an efficient lexical access yet requires less memory than that of a trie lexicon. Experiments are conducted to evaluate its performance for in-vocabulary lexical access, out-of-vocabulary word rejection, and substring matching. The effectiveness of the proposed approach is confirmed.
Keywords :
data structures; natural languages; vocabulary; word processing; Chinese lexical access; data structure; hash table; in-vocabulary lexical access; out-of-vocabulary word rejection; substring matching; tries; Application software; Computer science; Data structures; Handwriting recognition; Information retrieval; Natural languages; Optical character recognition software; Speech recognition; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
Print_ISBN :
0-7695-2420-6
DOI :
10.1109/ICDAR.2005.7