Development and suitability of Indian languages speech database for building watson based ASR system

Author

Pandey, D. ; Mondal, Tanmoy ; Agrawal, S.S. ; Bangalore, S.

Author_Institution

KIIT Coll. of Eng., Gurgaon, India

fYear

2013

fDate

25-27 Nov. 2013

Firstpage

1

Lastpage

6

Abstract

In this paper, we discuss our efforts in the development of Indian spoken languages corpora for building large vocabulary speech recognition systems using WATSON Toolkit. The current paper demonstrates that these corpora can be reduced to a varied degree for various phonemes by comparing the similarity among phonemes of different languages. We also discuss the design and methodology of collection of speech databases and the challenges we have faced during database creation. The experiments have been conducted on commonly known Indian languages, by training the ASR system with WATSON toolkit and evaluation by Sclite. The results for these experiments show that different Indian languages have a great similarity among their phoneme structures and phoneme sequences and we have exploited these features to create speech recognition system. Also, we have developed an algorithm to bootstrapping the phonemes of one particular language into another by mapping the phonemes of different languages. The performance of Hindi and Bangla ASR systems using these databases has been compared.

Keywords

audio databases; natural language processing; speech recognition; Bangla ASR systems; Hindi ASR systems; Indian language speech database; Indian spoken language corpora; Sclite; WATSON Toolkit; Watson based ASR system; phoneme bootstrapping; phoneme sequences; phoneme structures; vocabulary speech recognition systems; Accuracy; Acoustics; Data models; Databases; Hidden Markov models; Speech; Speech recognition; Indian Languages; Speech Recognition; Speech databases;

fLanguage

English

Publisher

ieee

Conference_Titel

Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference

Conference_Location

Gurgaon

Type

conf

DOI

10.1109/ICSDA.2013.6709861

Filename

6709861