DocumentCode
672869
Title
Issues in developing pronunciation lexicon for Marathi
Author
Bondale, Nandini ; Surve, Vrushali ; Nadkarni, Manasi ; Parkhi, Onkar ; Joshi, Pankaj ; Pandey, Ashutosh
Author_Institution
Sch. of Technol. & Comput. Sci., Tata Inst. of Fundamental Res., Mumbai, India
fYear
2013
fDate
25-27 Nov. 2013
Firstpage
1
Lastpage
4
Abstract
Language and speech resources are crucial for advancement of speech technology. Pronunciation lexicon is one among these. In this paper we describe our methodology of data collection for creating pronunciation lexicon for Marathi, an Indian language from an Indo-Aryan family. We mention in detail the issues which need to be dealt with as there is no one to one correspondence between the written forms and pronunciation. Selecting informants is equally important in creating pronunciation lexicon. We also list the issues which are directly related to informants´ speech as they need special attention while recording the data. The issues are resolved by using different methods of elicitation.
Keywords
natural language processing; speech processing; Indian language; Indo-Aryan family; Marathi; language resources; pronunciation lexicon development; speech resources; Data collection; Dictionaries; Educational institutions; Pragmatics; Presses; Speech; Standards; Indian language; Marathi; Pronunciation lexicon; W3C;
fLanguage
English
Publisher
ieee
Conference_Titel
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location
Gurgaon
Type
conf
DOI
10.1109/ICSDA.2013.6709894
Filename
6709894
Link To Document