• DocumentCode
    672850
  • Title

    Development of Kannada speech corpus for prosodically guided phonetic search engine

  • Author

    Shridhara, M.V. ; Banahatti, Bapu K. ; Narthan, L. ; Karjigi, Veena ; Kumaraswamy, R.

  • Author_Institution
    Dept. of Electron. & Commun., Siddaganga Inst. of Technol., Tumkur, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Development and availability of spoken language corpora in regional languages is of utmost importance for a multicultural and multilingual country like India. The issues of regional bias, accent, unique style and diversity associated with each geographical region and language will have a significant effect on the performance of speech recognition/synthesis systems. In this paper, collection of speech data in Kannada language for prosodically guided phonetic search engine and the issues involved in transcription are explained. The speech corpus consists of data in three different contexts namely, read mode, conversation mode and extempore mode. A four layered transcription namely, phonetic transcription using IPA symbols, syllabification, pitch marking and break marking is done for the entire data. A baseline recognition system for Kannada language is built using HTK for the data collected in different modes and the results are presented.
  • Keywords
    natural language processing; search engines; speech recognition; speech synthesis; IPA symbols; India; Kannada language; Kannada speech corpus; baseline recognition system; break marking; geographical language; geographical region; phonetic transcription; pitch marking; prosodically guided phonetic search engine; regional languages; speech recognition-synthesis systems; spoken language corpora; syllabification; Context; Databases; Engines; Search engines; Speech; Speech recognition; Vocabulary; Continuous speech recognition; Prosodically guided phonetic engine; Speech corpus; transcription;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709875
  • Filename
    6709875