DocumentCode :
1826497
Title :
Performance Analysis of Chinese Webpage Categorizing Algorithm Based on Support Vector Machines (SVM)
Author :
Gang, Xiao ; Jiancang, Xie
Author_Institution :
Sch. of Bus. Adm., Xi´´an Univ. of Technol., Xi´´an, China
Volume :
1
fYear :
2009
fDate :
18-20 Aug. 2009
Firstpage :
231
Lastpage :
235
Abstract :
Categorizing Web automatically for users is a key technique of information society, and the key point of this technique is Web training and categorization. This paper researches one of the important algorithm in this field-support vector machines (SVM). By analyzing and simulating 4 kinds of kernel function and 3 ways of feature selection, polynomial kernel function and document frequency is chosen for the best way in SVM algorithm. Meanwhile, pre-process algorithm is given in this paper in order to improve the efficiency of categorization. By simulation, importing pre-process method to SVM enhances the capability of the Web categorization both in precision and time-consumption.
Keywords :
Internet; classification; learning (artificial intelligence); support vector machines; Web training; chinese Webpage categorizing algorithm; document frequency; feature selection; information society; polynomial kernel function; support vector machine; time-consumption; Cleaning; Communications technology; Dictionaries; Information security; Kernel; Machine learning; Machine learning algorithms; Performance analysis; Support vector machines; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Assurance and Security, 2009. IAS '09. Fifth International Conference on
Conference_Location :
Xian
Print_ISBN :
978-0-7695-3744-3
Type :
conf
DOI :
10.1109/IAS.2009.316
Filename :
5284265
Link To Document :
بازگشت