• DocumentCode
    672869
  • Title

    Issues in developing pronunciation lexicon for Marathi

  • Author

    Bondale, Nandini ; Surve, Vrushali ; Nadkarni, Manasi ; Parkhi, Onkar ; Joshi, Pankaj ; Pandey, Ashutosh

  • Author_Institution
    Sch. of Technol. & Comput. Sci., Tata Inst. of Fundamental Res., Mumbai, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Language and speech resources are crucial for advancement of speech technology. Pronunciation lexicon is one among these. In this paper we describe our methodology of data collection for creating pronunciation lexicon for Marathi, an Indian language from an Indo-Aryan family. We mention in detail the issues which need to be dealt with as there is no one to one correspondence between the written forms and pronunciation. Selecting informants is equally important in creating pronunciation lexicon. We also list the issues which are directly related to informants´ speech as they need special attention while recording the data. The issues are resolved by using different methods of elicitation.
  • Keywords
    natural language processing; speech processing; Indian language; Indo-Aryan family; Marathi; language resources; pronunciation lexicon development; speech resources; Data collection; Dictionaries; Educational institutions; Pragmatics; Presses; Speech; Standards; Indian language; Marathi; Pronunciation lexicon; W3C;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709894
  • Filename
    6709894