Title :
Issues in developing pronunciation lexicon for Marathi
Author :
Bondale, Nandini ; Surve, Vrushali ; Nadkarni, Manasi ; Parkhi, Onkar ; Joshi, Pankaj ; Pandey, Ashutosh
Author_Institution :
Sch. of Technol. & Comput. Sci., Tata Inst. of Fundamental Res., Mumbai, India
Abstract :
Language and speech resources are crucial for advancement of speech technology. Pronunciation lexicon is one among these. In this paper we describe our methodology of data collection for creating pronunciation lexicon for Marathi, an Indian language from an Indo-Aryan family. We mention in detail the issues which need to be dealt with as there is no one to one correspondence between the written forms and pronunciation. Selecting informants is equally important in creating pronunciation lexicon. We also list the issues which are directly related to informants´ speech as they need special attention while recording the data. The issues are resolved by using different methods of elicitation.
Keywords :
natural language processing; speech processing; Indian language; Indo-Aryan family; Marathi; language resources; pronunciation lexicon development; speech resources; Data collection; Dictionaries; Educational institutions; Pragmatics; Presses; Speech; Standards; Indian language; Marathi; Pronunciation lexicon; W3C;
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
DOI :
10.1109/ICSDA.2013.6709894