DocumentCode :
3208830
Title :
On recognition of spoken Bengali numerals
Author :
Ghanty, Sumit Kumar ; Shaikh, Soharab Hossain ; Chaki, Nabendu
Author_Institution :
A.K Choudhury Sch. of Inf. Technol., Univ. of Calcutta, Kolkata, India
fYear :
2010
fDate :
8-10 Oct. 2010
Firstpage :
54
Lastpage :
59
Abstract :
This paper presents a method for recognizing isolated spoken Bengali numerals. Noisy audio samples have been considered as input in this study. Mel frequency cepstral coefficients (MFCC) have been used for extraction of feature from the audio samples. Vector quantization is applied to reduce the dimension of the feature vectors and to generate a vector codebook for the numerals. The classification is based on the dynamic time warping (DTW) and a minimum distance classifier based on Euclidean distance measure. Both the speaker dependent and speaker independent situations have been considered for checking accuracy. Results show the limitations of MFCC based standard speech processing approach in speaker independent spoken digit recognition scenario in the presence of noise.
Keywords :
cepstral analysis; feature extraction; natural language processing; speaker recognition; speech coding; time warp simulation; vector quantisation; Euclidean distance; Mel frequency cepstral coefficient; distance classifier; dynamic time warping; feature extraction; noisy audio sample; speaker Recognition; speech processing; spoken Bengali numeral; vector codebook; vector quantization; Classification algorithms; Mel frequency cepstral coefficient; Noise; Speech; Speech recognition; Support vector machine classification; Vector quantization; Euclidean distance classifier; Mel frequency cepstral coefficients; Spoken numeral recognition; dynamic time warping; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
Conference_Location :
Krackow
Print_ISBN :
978-1-4244-7817-0
Type :
conf
DOI :
10.1109/CISIM.2010.5643692
Filename :
5643692
Link To Document :
بازگشت