DocumentCode :
1908062
Title :
Recognizing Chinese Elementary Discourse Unit on Comma
Author :
Shengqin Xu ; Peifeng Li
Author_Institution :
Natural Language Process. Lab., Soochow Univ., Suzhou, China
fYear :
2013
fDate :
17-19 Aug. 2013
Firstpage :
3
Lastpage :
6
Abstract :
Element discourse unit (EDU) recognition is the primary task of discourse analysis. Chinese punctuation is viewed as a delimiter of elementary discourse units in Chinese. In this paper, we consider Chinese comma to be the boundary of the discourse units and also to anchor discourse relations between units separated by comma. We divide it into seven major types based on syntactic patterns and propose three different machine learning methods to automatically disambiguate the type of Chinese comma. The experimental results on Chinese Tree bank 6.0 show that our method outperforms the baseline.
Keywords :
grammars; learning (artificial intelligence); natural language processing; trees (mathematics); Chinese comma; Chinese elementary discourse unit; Chinese punctuation; Chinese tree bank 6.0; EDU recognition; discourse analysis; discourse relations; machine learning methods; syntactic patterns; Accuracy; Analytical models; Educational institutions; Grammar; IP networks; Natural language processing; Syntactics; Multi-classifier; chinese comma; elementary discourse units recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
Type :
conf
DOI :
10.1109/IALP.2013.8
Filename :
6645990
Link To Document :
بازگشت