Title :
Research on DOP-based Chinese parsing
Author :
Zhang, Yue-jie ; Zhang, Tao ; Zhu, Jing-Bo ; Yao, Tian-Shun
Author_Institution :
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
Abstract :
This paper presents a Chinese parsing method which takes data-oriented parsing technique as the basic framework and utilizes the similarity-based probability estimate technique. Through the initial selection process, the fragment-combination forms of the input sentence are acquired on the constructed knowledge source including treebank, fragment-bank and fragment-combination-bank. Then by using the similarity-based probability estimate technique, the combination parsing process can be completed successfully. To prove the method efficiency, the knowledge source is constructed on the real-world Chinese corpus, and the other corpus is used as the test set. The experiment results show that every test parameter is satisfied.
Keywords :
data handling; estimation theory; grammars; natural languages; probability; Chinese corpus; DOP-based Chinese parsing; data-oriented parsing technique; fragment-combination-bank; similarity-based probability estimate technique; treebank; Computer science; Data engineering; Distributed computing; Finance; Humans; Information management; Information processing; Laboratories; Maximum likelihood estimation; Testing; Data-oriented parsing; fragment-bank; fragment-combination-bank; similarity estimate; treebank;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527609