Title :
Combining Support Vector Machines, Border Revised Rules and Transformation-based Error-driven Learning for Chinese Chunking
Author :
Wei Yuan ; Zhang Ling-yu ; Zhang Ya-xuan ; He Lu ; Fang Ding-yi
Author_Institution :
Sch. of Inf. Sci. & Technol., Northwest Univ., Xi´an, China
Abstract :
In research work, we found that grammatical information in the Modern Chinese Grammar Information Dictionary is very effective to revise chunk border. So the Modern Chinese Grammar Information Dictionary used to extract the chunk Border Revised Rules (BRR). In this paper, a new method of chunking is proposed--combined with BRR and TBL, SVM used for chunking. We reduced the number of SVM feature vector, and use SVM for chunking. Since the most chunk border error can be corrected by BRR, we reduced the number of feature vectors to shorten the training and chunking time of SVM. Finally, the data should be trained and modified again by TBL to obtain further the accuracy and the recall rate of improvement. We compare our method with method that combined with SVM and TBL. The experimental results show the method improves the precision and recall rates, while also reducing working hours.
Keywords :
dictionaries; grammars; learning (artificial intelligence); natural languages; support vector machines; Chinese chunking; border revised rule; feature vector; grammatical information; modern Chinese grammar information dictionary; support vector machine; transformation-based error driven learning; Accuracy; Dictionaries; Grammar; Speech; Support vector machines; Tagging; Training; Chunking; Dictionary of Modern Chinese Grammar information; SVM; TBL;
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
DOI :
10.1109/AICI.2010.87