DocumentCode :
3580548
Title :
Annotating Indirect Anaphora for Hindi: A Corpus Based Study
Author :
Singh, Pardeep ; Dutta, Kamlesh
Author_Institution :
Comput. Sci. & Eng., Nat. Inst. of Technol., Hamirpur, India
fYear :
2014
Firstpage :
525
Lastpage :
529
Abstract :
Natural language processing requires a lot of analysis and information regarding words and segment of sentence. Almost all NLP applications such as machine translation, information extraction, automatic summarization, question answering system, natural language generation, etc., require successful identification and resolution of anaphora. Information regarding word using POS tagger, parser and other tool can be gathered. Hindi is language of free word order as compare to English. This enforces additional constraints on different NLP task. In this working paper we present an analysis of Hindi genre. We used ten tags from literature. Out of ten tags seven are annotated using Botley´s annotation scheme manually. We annotated 1540 demonstrative pronoun from twelve files of EMILEE corpus. Input file is EMILEE file and output is fully annotated unicode file.
Keywords :
grammars; natural language processing; Botley annotation scheme; Hindi; NLP application; POS tagger; anaphora resolution; natural language processing; parser; Computational linguistics; Feature extraction; Pragmatics; Semantics; Support vector machines; Syntactics; Tagging; anaphora resolution; annotation; case marker; natural language processing; semantic category;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Communication Networks (CICN), 2014 International Conference on
Print_ISBN :
978-1-4799-6928-9
Type :
conf
DOI :
10.1109/CICN.2014.120
Filename :
7065540
Link To Document :
بازگشت