DocumentCode :
2352791
Title :
ABHIDHA: an extended WordNet for Indo Aryan languages
Author :
Annam, Shireesh Reddy ; Choudhury, Monojit ; Sudeshna Sarkar ; Basu, Anupam
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
fYear :
2003
fDate :
10-11 March 2003
Firstpage :
1
Lastpage :
8
Abstract :
A lexical knowledge base is an important component of any intelligent information processing system. The WordNet developed at the Cognitive Systems Laboratories at Princeton has served as a lexical reference system for natural language processing activities. The Indian language based activities at our institute mainly in text-to-speech synthesis and natural language generation from iconic inputs require the inclusion of additional features in the lexical reference system like phonology, word roots, and etymological information. Our initial efforts have been in Hindi and Bengali but commonality of Indo Aryan Languages and the importance of these extra features lead us to believe that it is a worthwhile effort to build-up a WordNet for other Indo Aryan languages containing these features. In this paper, we speak of the issues relating to the structured design and development of a generalized extended WordNet for Indo Aryan languages with special reference to Hindi and Bengali.
Keywords :
knowledge representation; language translation; natural languages; speech synthesis; text analysis; ABHIDHA; Bengali; Cognitive Systems Laboratories; Hindi; Indian language; Indo Aryan languages; WordNet; etymological information; iconic inputs; intelligent information processing system; lexical knowledge; lexical reference system; natural language generation; natural language processing; phonology; text-to-speech synthesis; word roots; Computer science; Dictionaries; Information processing; Intelligent systems; Knowledge engineering; Knowledge representation; Natural language processing; Natural languages; Speech; Thesauri;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Research Issues in Data Engineering: Multi-lingual Information Management, 2003. RIDE-MLIM 2003. Proceedings. 13th International Workshop on
ISSN :
1066-1395
Print_ISBN :
0-7803-7868-7
Type :
conf
DOI :
10.1109/RIDE.2003.1249839
Filename :
1249839
Link To Document :
بازگشت