• DocumentCode
    3055612
  • Title

    Automatic identification of pronominal Anaphora in Turkish texts

  • Author

    Küçük, Dilek ; Yöndem, Meltem Turhan

  • Author_Institution
    Middle East Tech. Univ., Ankara
  • fYear
    2007
  • fDate
    7-9 Nov. 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Anaphora identification is an important problem especially for its impact on anaphora and coreference resolution systems. In this paper, a system that automatically identifies anaphoric pronouns in Turkish is presented. The proposed system takes a decision tree learning approach, that of Quinlan´s C 4.5, where a corpus examination is carried out to determine linguistic features specific to Turkish which are to be used by the decision tree learner. The proposed system is significant especially for its ease of incorporation into any anaphora resolution system for Turkish. The system is evaluated on two different Turkish text samples and its performance on these samples is close to that of human identification.
  • Keywords
    decision trees; learning (artificial intelligence); natural language processing; text analysis; Turkish texts; anaphoric pronouns; automatic identification; coreference resolution systems; decision tree learning approach; linguistic features; pronominal anaphora; Data preprocessing; Decision trees; Humans; Information retrieval; Machine learning; Mars; Natural language processing; Natural languages; Usability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and information sciences, 2007. iscis 2007. 22nd international symposium on
  • Conference_Location
    Ankara
  • Print_ISBN
    978-1-4244-1363-8
  • Electronic_ISBN
    978-1-4244-1364-5
  • Type

    conf

  • DOI
    10.1109/ISCIS.2007.4456858
  • Filename
    4456858