DocumentCode :
2135258
Title :
Study of the word segmentation algorithm based on Hash dictionary mechanism
Author :
Jun, Qin ; Ping, Zong ; Xi, Lu
fYear :
2012
fDate :
21-23 April 2012
Firstpage :
3526
Lastpage :
3529
Abstract :
Machine learning of data analysis and processing in internet allows users to access information quickly and conveniently. As most of the information is text, so that automatic segmentation technology has great significance. The word segmentation dictionary is an important component of the Chinese automatic word segmentation system. The speed of the dictionary loading and query can affect the speed of segmentation system directly. This paper proposes an improved word segmentation mechanism based on double word Hash. The test result shows that the improved word segmentation algorithm enhances the query speed and efficiency of the term matches.
Keywords :
Internet; data analysis; file organisation; learning (artificial intelligence); natural language processing; query processing; Chinese automatic word segmentation; Internet; automatic segmentation technology; data analysis; double word hash; hash dictionary mechanism; machine learning; query speed; term matches; word segmentation algorithm; word segmentation dictionary; Algorithm design and analysis; Decision support systems; Dictionaries; Indexing; Internet; Loading; Organizations; Artificial Intelligence; Chinese Word Segmentation; Dictionary Mechanism; Double Layer Hash Indexing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on
Conference_Location :
Yichang
Print_ISBN :
978-1-4577-1414-6
Type :
conf
DOI :
10.1109/CECNet.2012.6202296
Filename :
6202296
Link To Document :
بازگشت