Title :
The Dictionary Mechanism for Chinese Word Segmentation -- Hashing the Initial and Termination of the Chinese Words
Author_Institution :
Math. & Inf. Technol., Hanshan Normal Univ., Chaozhou, China
Abstract :
For the purpose of improving the efficiency of Chinese word segmentation, this paper puted forward the method of using initial and termination to distinguish the Chinese words on the basis of analyzing Chinese word characteristics in Chinese dictionary. The result showed that efficiency is enhanced by adopting this mechanism.
Keywords :
dictionaries; text analysis; Chinese dictionary; Chinese word characteristics; Chinese word segmentation; hashing; Algorithm design and analysis; Arrays; Classification algorithms; Dictionaries; Indexing; Information processing; Text processing; distinction degree; hash conflict; segmentation algorithm; segmentation dictionary;
Conference_Titel :
Intelligent Systems (GCIS), 2010 Second WRI Global Congress on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-9247-3
DOI :
10.1109/GCIS.2010.193