Title :
Chinese text mining based on distributed SMO
Author :
Zhang, Yan ; Jiang, Mingyan ; Yuan, Dongfeng
Author_Institution :
Sch. of Inf. Sci. & Eng., Shandong Univ., Jinan, China
Abstract :
The paper presents the good classification accuracy of Support Vector Machine (SVM) with optimized parameters by particle swarm optimizer (PSO) in Chinese text classification. Through the simulation we also see that its training speed is slow when we deal with large amounts of texts in dataset, and it affects classification performance. Platt´s sequential minimal optimization (SMO) is one of the fastest algorithms for training SVMs, so we introduce the distributed SMO using multiple core processors to process the training data as a fast classification of large amounts of texts in dataset.
Keywords :
multiprocessing systems; natural language processing; particle swarm optimisation; pattern classification; support vector machines; text analysis; Chinese text classification; Chinese text mining; Platt sequential minimal optimization; SVM; distributed SMO; multiple core processors; support vector machine; Educational institutions; Support vector machines; SMO; SVM; distributed computing; text classification;
Conference_Titel :
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-61284-485-5
DOI :
10.1109/ICCSN.2011.6014416