DocumentCode
3300470
Title
Automatic identification of non-anaphoric anaphora in spoken dialog
Author
Fei, Zhongchao ; Huang, Xuanjing ; Weng, Fuliang
Author_Institution
Dept. of Comput. Sci., Fudan Univ., Shanghai
fYear
2008
fDate
19-22 Oct. 2008
Firstpage
1
Lastpage
6
Abstract
Identification of non-anaphoric anaphora is an important step towards a full anaphora resolution. In this paper, we present an automatic identification approach for this task. In our work, some novel features are proposed, which are based on dependency grammars, surrounding words and their POS tags. All the features are automatically extracted using a part-of-speech (POS) tagger and a dependency parser. Our experiments are on a commonly available dialogue corpus, Trains-93. Several machine learning algorithms are used in the experiments, including CME, CRF and SVM. Results show that compared to the approaches used in the previous work, our algorithm is simpler and achieves a higher accuracy.
Keywords
computational linguistics; text analysis; anaphora resolution; automatic nonanaphoric anaphora identification; part-of-speech tagger; spoken dialog; Classification tree analysis; Computer science; Decision trees; Feature extraction; Machine learning algorithms; Pattern matching; Statistics; Support vector machine classification; Support vector machines; Spoken dialog; anaphora resolution; non-anaphoric anaphora identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4515-8
Electronic_ISBN
978-1-4244-2780-2
Type
conf
DOI
10.1109/NLPKE.2008.4906761
Filename
4906761
Link To Document