Title :
Automatic Classification of Uighur Web Pages
Author :
Xu Guixian ; Gao Xu ; Zhao Xiaobing ; Yang Guosheng
Author_Institution :
Coll. of Inf. Eng., Minzu Univ. of China, Beijing, China
Abstract :
In this paper, we introduce a classification approach for Uighur web pages. It utilizes the class feature dictionary and Cosine similarity computation to classify the Uighur web pages into the predefined classes rapidly and accurately. The experimental result shows that the approach has a good classification performance for Uighur web pages classification. It is useful and helpful for the construction of the statistical and rule-based classification of Uighur texts as well as construction of high-quality Uighur corpus.
Keywords :
Internet; knowledge based systems; natural language processing; pattern classification; statistical analysis; text analysis; Uighur Web page; Uighur corpus; Uighur text; Web page classification; class feature dictionary; classification performance; cosine similarity computation; rule-based classification; statistical classification; Dictionaries; Feature extraction; Information processing; Kernel; Text categorization; Web pages; Classification of Web Pages; Text classification; Uighur Information Processing;
Conference_Titel :
Intelligent System Design and Engineering Applications (ISDEA), 2013 Third International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-4893-5
DOI :
10.1109/ISDEA.2012.97