Title :
Development and suitability of Indian languages speech database for building watson based ASR system
Author :
Pandey, D. ; Mondal, Tanmoy ; Agrawal, S.S. ; Bangalore, S.
Author_Institution :
KIIT Coll. of Eng., Gurgaon, India
Abstract :
In this paper, we discuss our efforts in the development of Indian spoken languages corpora for building large vocabulary speech recognition systems using WATSON Toolkit. The current paper demonstrates that these corpora can be reduced to a varied degree for various phonemes by comparing the similarity among phonemes of different languages. We also discuss the design and methodology of collection of speech databases and the challenges we have faced during database creation. The experiments have been conducted on commonly known Indian languages, by training the ASR system with WATSON toolkit and evaluation by Sclite. The results for these experiments show that different Indian languages have a great similarity among their phoneme structures and phoneme sequences and we have exploited these features to create speech recognition system. Also, we have developed an algorithm to bootstrapping the phonemes of one particular language into another by mapping the phonemes of different languages. The performance of Hindi and Bangla ASR systems using these databases has been compared.
Keywords :
audio databases; natural language processing; speech recognition; Bangla ASR systems; Hindi ASR systems; Indian language speech database; Indian spoken language corpora; Sclite; WATSON Toolkit; Watson based ASR system; phoneme bootstrapping; phoneme sequences; phoneme structures; vocabulary speech recognition systems; Accuracy; Acoustics; Data models; Databases; Hidden Markov models; Speech; Speech recognition; Indian Languages; Speech Recognition; Speech databases;
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
DOI :
10.1109/ICSDA.2013.6709861