Title :
New Word Recognition Based on Support Vector Machines and Constraints
Author :
Xu Yuanfang ; Gu Hui
Author_Institution :
Coll. of Network Technol., Inner Mongolia Normal Univ., Hohhot, China
Abstract :
This paper studies a new method for identifying the new words, Objective to identify new words better. Method is first to extract the positive and negative samples from training corpus which was handled by segmentation and POS Tagging according to the dictionary, then combining with all kinds of words classification which was gotten from training corpus, and gaining the new word support vector through the training of supporting vector machine. Word segmentation and POS Tagging on the test of corpus containing simulated new words, in conjunction with the relevant constraints and the slack variables are proposed to select candidate new words, as to the quantized input and support vector machine classifier calculate by combining with the word itself characteristics, getting the relevant results is compared with a threshold and getting new words. As the results, the radial basis function (RBF) when the new word identification system recall rate and correct rate of the optimal. Conclusion is through this method can improve the accuracy of word recognition and recall.
Keywords :
pattern classification; radial basis function networks; support vector machines; word processing; POS tagging; RBF; radial basis function; support vector machine classifier; support vector machines; word identification system; word recognition; words classification; Accuracy; Character recognition; Dictionaries; Kernel; Support vector machine classification; Training; New words recognition; constraints; kernel function; support vector machine;
Conference_Titel :
Information Science and Control Engineering (ICISCE), 2015 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-6849-0
DOI :
10.1109/ICISCE.2015.82