Title :
Research on Root Node Finder in Chinese Long Sentences
Author :
Wang Hongsheng ; Li Yu´e ; Xiao Rui
Author_Institution :
Sch. of Inf. Sci. & Eng., Shenyang Univ. of Technol., Shenyang, China
Abstract :
Chinese dependency relationship is complex and dependency span between the words is large especially in Chinese long sentences. Considering greed problems caused by Arc-eager algorithm for solving the long-distance dependencies in long sentences, this paper constructs a Root Node Finder. It can divide a long sentence into two short sentences. Using HIT Dependency Tree bank as a training test set, this paper uses Arc-eager algorithm and machine learning for dependency analysis of the whole sentence. The results show that the root accuracy of syntactic analysis is 77.25%. Then experiment uses LIBSVM as a binary classifier and adds adding different features for the Root Node Finder. At last, it identifies the optimal combination of features to impact the Root Node Finder. Results show that optimal features added, the root accuracy is 93.05%.
Keywords :
learning (artificial intelligence); natural language processing; pattern classification; support vector machines; text analysis; Arc-eager algorithm; Chinese dependency relationship; Chinese long sentences; HIT Dependency Treebank; LIBSVM; Root Node Finder; binary classifier; greed problems; machine learning; root accuracy; short sentences; support vector machine; syntactic analysis; training test set; whole sentence dependency analysis; Accuracy; Algorithm design and analysis; Feature extraction; Speech; Support vector machines; Syntactics; Training; Chinese long sentences; Root Node Finder; dependency; syntactic analysis;
Conference_Titel :
Intelligent Networks and Intelligent Systems (ICINIS), 2013 6th International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-1-4799-2808-8
DOI :
10.1109/ICINIS.2013.8