• DocumentCode
    3300470
  • Title

    Automatic identification of non-anaphoric anaphora in spoken dialog

  • Author

    Fei, Zhongchao ; Huang, Xuanjing ; Weng, Fuliang

  • Author_Institution
    Dept. of Comput. Sci., Fudan Univ., Shanghai
  • fYear
    2008
  • fDate
    19-22 Oct. 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Identification of non-anaphoric anaphora is an important step towards a full anaphora resolution. In this paper, we present an automatic identification approach for this task. In our work, some novel features are proposed, which are based on dependency grammars, surrounding words and their POS tags. All the features are automatically extracted using a part-of-speech (POS) tagger and a dependency parser. Our experiments are on a commonly available dialogue corpus, Trains-93. Several machine learning algorithms are used in the experiments, including CME, CRF and SVM. Results show that compared to the approaches used in the previous work, our algorithm is simpler and achieves a higher accuracy.
  • Keywords
    computational linguistics; text analysis; anaphora resolution; automatic nonanaphoric anaphora identification; part-of-speech tagger; spoken dialog; Classification tree analysis; Computer science; Decision trees; Feature extraction; Machine learning algorithms; Pattern matching; Statistics; Support vector machine classification; Support vector machines; Spoken dialog; anaphora resolution; non-anaphoric anaphora identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4515-8
  • Electronic_ISBN
    978-1-4244-2780-2
  • Type

    conf

  • DOI
    10.1109/NLPKE.2008.4906761
  • Filename
    4906761