DocumentCode :
3381378
Title :
Applying machine learning to identify Chinese discourse markers
Author :
Tsou, Benjamin K. ; Gao, Weijun ; Lai, Tom B Y ; Chan, Samuel W K
Author_Institution :
Language Inf. Sci. Res. Center, City Univ. of Hong Kong, Hong Kong
fYear :
1999
fDate :
1999
Firstpage :
548
Lastpage :
553
Abstract :
With their high occurrence rates in argumentative Chinese texts, discourse markers play a significant role in the automatic processing of these kinds of Chinese texts, such as automatic summarization. The paper reports on an effort in applying machine learning to identify discourse markers in Chinese. We have processed 80 Chinese texts from which we have selected subsets for data training and data testing. We used C4.5 in our experiments and obtained accuracies of the order of 80%. Accuracies obtained by neural network are a bit worse than that of C4.5. We also interpret and analyze our experimental results in the linguistic perspective
Keywords :
learning (artificial intelligence); linguistics; natural languages; neural nets; text analysis; Chinese discourse marker identification; argumentative Chinese texts; automatic processing; automatic summarization; data testing; data training; linguistic perspective; machine learning; neural network; Machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Intelligence and Systems, 1999. Proceedings. 1999 International Conference on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-0446-9
Type :
conf
DOI :
10.1109/ICIIS.1999.810345
Filename :
810345
Link To Document :
بازگشت