DocumentCode
2486253
Title
Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali
Author
Banerjee, Pratyush ; Garg, Gaurav ; Mitra, Pabitra ; Basu, Anupam
Author_Institution
Commun. Empowerment Lab., IIT Kharagpur, Kharagpur
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
The performance of the acoustic models is highly reflective on the overall performance of any continuous speech recognition system. Hence generation of an accurate and robust acoustic model holds the key to satisfactory recognition performance. As phones are found to vary according to the position of occurrence within a particular word, context information is of prime importance in acoustic modeling of phonetic signals. In this paper we look at the effect of triphone-based acoustic modeling over monophone based acoustic models in the context of continuous speech recognition in Bengali. Keeping in mind the lack of training resources for triphone-based acoustic modeling in Bengali, we have also described herein, the method of generating triphone clusters using decision tree based techniques. These triphone clusters have then been used to generate tied-state triphone based acoustic models to be used in a continuous speech recognizer.
Keywords
acoustic signal detection; acoustic signal processing; decision trees; speech recognition; context information; continuous speech recognition system; continuous speech recognizer; decision tree; monophone based acoustic model; phones; phonetic signal; robust acoustic model; tied-state triphone based acoustic model; triphone clustering; triphone clusters; triphone-based acoustic modeling; Acoustic applications; Context modeling; Decision trees; Engines; Hidden Markov models; Mel frequency cepstral coefficient; Natural languages; Robustness; Signal generators; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761657
Filename
4761657
Link To Document