DocumentCode :
2910924
Title :
Developing Bengali Speech Corpus for Phone Recognizer Using Optimum Text Selection Technique
Author :
Mandal, Sandipan ; Das, Biswajit ; Mitra, Pabitra ; Basu, Anupam
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Kharagpur, India
fYear :
2011
fDate :
15-17 Nov. 2011
Firstpage :
268
Lastpage :
271
Abstract :
Speech corpus plays a key role in construction of automatic speech recognition (ASR), text-to-speech (TTS) synthesis and phone recognition (PR) system. PR system and ASR system are quite similar in functionality. The difference between these two is that for PR system the speech signal is converted to phonefootnote{smallest discrete segment of sound in uttered speech} text whereas for ASR system the speech signal is converted to word text. Speech corpus for PR system usually consists of a text corpus, recording data corresponding to the text corpus, phonetic representation of the text corpus and a pronunciation dictionary. Selecting optimum text from available text with balanced phone distribution is an important task for developing high quality PR system. In this paper, we describe our text selection technique and discuss the performance of phone recognition system.
Keywords :
speech recognition; speech synthesis; text analysis; ASR system; Bengali speech corpus; PR system; automatic speech recognition system; balanced phone distribution; phone recognition system; phone recognizer; phone-footnote text; phonetic representation; pronunciation dictionary; speech signal; text corpus; text selection technique; text-to-speech synthesis system; Accuracy; Computational modeling; Dictionaries; Hidden Markov models; Speech; Speech recognition; Text recognition; GMM; HMM; MFCC; phoneme; sphinx3; sphinxtrain;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
Type :
conf
DOI :
10.1109/IALP.2011.16
Filename :
6121518
Link To Document :
بازگشت