• DocumentCode
    672836
  • Title

    Development and suitability of Indian languages speech database for building watson based ASR system

  • Author

    Pandey, D. ; Mondal, Tanmoy ; Agrawal, S.S. ; Bangalore, S.

  • Author_Institution
    KIIT Coll. of Eng., Gurgaon, India
  • fYear
    2013
  • fDate
    25-27 Nov. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper, we discuss our efforts in the development of Indian spoken languages corpora for building large vocabulary speech recognition systems using WATSON Toolkit. The current paper demonstrates that these corpora can be reduced to a varied degree for various phonemes by comparing the similarity among phonemes of different languages. We also discuss the design and methodology of collection of speech databases and the challenges we have faced during database creation. The experiments have been conducted on commonly known Indian languages, by training the ASR system with WATSON toolkit and evaluation by Sclite. The results for these experiments show that different Indian languages have a great similarity among their phoneme structures and phoneme sequences and we have exploited these features to create speech recognition system. Also, we have developed an algorithm to bootstrapping the phonemes of one particular language into another by mapping the phonemes of different languages. The performance of Hindi and Bangla ASR systems using these databases has been compared.
  • Keywords
    audio databases; natural language processing; speech recognition; Bangla ASR systems; Hindi ASR systems; Indian language speech database; Indian spoken language corpora; Sclite; WATSON Toolkit; Watson based ASR system; phoneme bootstrapping; phoneme sequences; phoneme structures; vocabulary speech recognition systems; Accuracy; Acoustics; Data models; Databases; Hidden Markov models; Speech; Speech recognition; Indian Languages; Speech Recognition; Speech databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
  • Conference_Location
    Gurgaon
  • Type

    conf

  • DOI
    10.1109/ICSDA.2013.6709861
  • Filename
    6709861