DocumentCode :
3020907
Title :
A data structure using hashing and tries for efficient Chinese lexical access
Author :
LAM, Yat-Kin ; Huo, Qiang
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ., China
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
506
Abstract :
A lexicon is needed in many applications. In the past, different structures such as tries, hash tables and their variants have been investigated for lexicon organization and lexical access. In this paper, we propose a new data structure that combines the use of hash table and tries for storing a Chinese lexicon. The data structure facilitates an efficient lexical access yet requires less memory than that of a trie lexicon. Experiments are conducted to evaluate its performance for in-vocabulary lexical access, out-of-vocabulary word rejection, and substring matching. The effectiveness of the proposed approach is confirmed.
Keywords :
data structures; natural languages; vocabulary; word processing; Chinese lexical access; data structure; hash table; in-vocabulary lexical access; out-of-vocabulary word rejection; substring matching; tries; Application software; Computer science; Data structures; Handwriting recognition; Information retrieval; Natural languages; Optical character recognition software; Speech recognition; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.7
Filename :
1575597
Link To Document :
بازگشت