• DocumentCode
    536050
  • Title

    Combining Support Vector Machines, Border Revised Rules and Transformation-based Error-driven Learning for Chinese Chunking

  • Author

    Wei Yuan ; Zhang Ling-yu ; Zhang Ya-xuan ; He Lu ; Fang Ding-yi

  • Author_Institution
    Sch. of Inf. Sci. & Technol., Northwest Univ., Xi´an, China
  • Volume
    1
  • fYear
    2010
  • fDate
    23-24 Oct. 2010
  • Firstpage
    383
  • Lastpage
    387
  • Abstract
    In research work, we found that grammatical information in the Modern Chinese Grammar Information Dictionary is very effective to revise chunk border. So the Modern Chinese Grammar Information Dictionary used to extract the chunk Border Revised Rules (BRR). In this paper, a new method of chunking is proposed--combined with BRR and TBL, SVM used for chunking. We reduced the number of SVM feature vector, and use SVM for chunking. Since the most chunk border error can be corrected by BRR, we reduced the number of feature vectors to shorten the training and chunking time of SVM. Finally, the data should be trained and modified again by TBL to obtain further the accuracy and the recall rate of improvement. We compare our method with method that combined with SVM and TBL. The experimental results show the method improves the precision and recall rates, while also reducing working hours.
  • Keywords
    dictionaries; grammars; learning (artificial intelligence); natural languages; support vector machines; Chinese chunking; border revised rule; feature vector; grammatical information; modern Chinese grammar information dictionary; support vector machine; transformation-based error driven learning; Accuracy; Dictionaries; Grammar; Speech; Support vector machines; Tagging; Training; Chunking; Dictionary of Modern Chinese Grammar information; SVM; TBL;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
  • Conference_Location
    Sanya
  • Print_ISBN
    978-1-4244-8432-4
  • Type

    conf

  • DOI
    10.1109/AICI.2010.87
  • Filename
    5656385