Title :
Study of the word segmentation algorithm based on Hash dictionary mechanism
Author :
Jun, Qin ; Ping, Zong ; Xi, Lu
Abstract :
Machine learning of data analysis and processing in internet allows users to access information quickly and conveniently. As most of the information is text, so that automatic segmentation technology has great significance. The word segmentation dictionary is an important component of the Chinese automatic word segmentation system. The speed of the dictionary loading and query can affect the speed of segmentation system directly. This paper proposes an improved word segmentation mechanism based on double word Hash. The test result shows that the improved word segmentation algorithm enhances the query speed and efficiency of the term matches.
Keywords :
Internet; data analysis; file organisation; learning (artificial intelligence); natural language processing; query processing; Chinese automatic word segmentation; Internet; automatic segmentation technology; data analysis; double word hash; hash dictionary mechanism; machine learning; query speed; term matches; word segmentation algorithm; word segmentation dictionary; Algorithm design and analysis; Decision support systems; Dictionaries; Indexing; Internet; Loading; Organizations; Artificial Intelligence; Chinese Word Segmentation; Dictionary Mechanism; Double Layer Hash Indexing;
Conference_Titel :
Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on
Conference_Location :
Yichang
Print_ISBN :
978-1-4577-1414-6
DOI :
10.1109/CECNet.2012.6202296