Title :
A system for identification of idioms in Hindi
Author :
Priyanka ; Sinha, R.M.K.
Author_Institution :
Comput. Sci. & Eng., Noida, India
Abstract :
Idioms are extensively used in everyday language. They carry a metaphorical sense that makes their comprehension difficult as their meaning cannot be deduced from the meaning of their constituent parts. They pose a challenge for Natural language processing (NLP) applications like machine translation, information retrieval and question answering as their translation and meaning needs to be derived logically rather than literally. A lot of research work has been carried out into automatic extraction of multi-word expressions, but no comprehensive work has been reported on idioms in Hindi. In this paper, an attempt has been made to study the linguistic and morphological variations that are usually encountered in idioms in Hindi. Based on this study, a methodology for deriving rules for representation of idioms and their search has been developed. The rules representing the idioms are hand crafted. For the idiom identification, rule-base has been used to mark the input text for probable presence of idiom. Our system is limited to use only intra-sentential context. The experimental results demonstrate feasibility and scalability of our methodology.
Keywords :
language translation; natural language processing; question answering (information retrieval); Hindi; NLP; idiom identification; information retrieval; machine translation; metaphorical sense; multiword expressions; natural language processing; question answering; Arrays; Context; Data mining; Databases; Natural language processing; Semantics; Syntactics; Hindi; NLP; idiom variations; idioms;
Conference_Titel :
Contemporary Computing (IC3), 2014 Seventh International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-5172-7
DOI :
10.1109/IC3.2014.6897218