Title :
A Tibetan and Uygur Sensitive Word Automatically Add System Based on Co-occurrence
Author :
Yan, Xiaodong ; Sun, Yuan ; Zhao, Xiaobing ; Yang, Guosheng
Author_Institution :
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
Abstract :
In this paper, we first built a sensitive word vocabulary and classified the sensitive words. Then in order to add Tibetan and Uygur sensitive word automatically, we adopted a method based on co-occurrence. In our system, we used a simple algorithm to calculate the relevance of sensitive words. According to the relevance of words, the new sensitive words are added to the vocabulary automatically.
Keywords :
natural language processing; Tibetan sensitive word automatically add system; Uygur sensitive word automatically add system; sensitive word vocabulary; word relevance; Correlation; Educational institutions; Indexes; Monitoring; Real-time systems; Sensitivity; Vocabulary; co-occurrence; sensitive words; vocabulary;
Conference_Titel :
Intelligent Networks and Intelligent Systems (ICINIS), 2012 Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4673-3083-1
DOI :
10.1109/ICINIS.2012.87