• DocumentCode
    2352791
  • Title

    ABHIDHA: an extended WordNet for Indo Aryan languages

  • Author

    Annam, Shireesh Reddy ; Choudhury, Monojit ; Sudeshna Sarkar ; Basu, Anupam

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
  • fYear
    2003
  • fDate
    10-11 March 2003
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    A lexical knowledge base is an important component of any intelligent information processing system. The WordNet developed at the Cognitive Systems Laboratories at Princeton has served as a lexical reference system for natural language processing activities. The Indian language based activities at our institute mainly in text-to-speech synthesis and natural language generation from iconic inputs require the inclusion of additional features in the lexical reference system like phonology, word roots, and etymological information. Our initial efforts have been in Hindi and Bengali but commonality of Indo Aryan Languages and the importance of these extra features lead us to believe that it is a worthwhile effort to build-up a WordNet for other Indo Aryan languages containing these features. In this paper, we speak of the issues relating to the structured design and development of a generalized extended WordNet for Indo Aryan languages with special reference to Hindi and Bengali.
  • Keywords
    knowledge representation; language translation; natural languages; speech synthesis; text analysis; ABHIDHA; Bengali; Cognitive Systems Laboratories; Hindi; Indian language; Indo Aryan languages; WordNet; etymological information; iconic inputs; intelligent information processing system; lexical knowledge; lexical reference system; natural language generation; natural language processing; phonology; text-to-speech synthesis; word roots; Computer science; Dictionaries; Information processing; Intelligent systems; Knowledge engineering; Knowledge representation; Natural language processing; Natural languages; Speech; Thesauri;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Issues in Data Engineering: Multi-lingual Information Management, 2003. RIDE-MLIM 2003. Proceedings. 13th International Workshop on
  • ISSN
    1066-1395
  • Print_ISBN
    0-7803-7868-7
  • Type

    conf

  • DOI
    10.1109/RIDE.2003.1249839
  • Filename
    1249839