• DocumentCode
    3208830
  • Title

    On recognition of spoken Bengali numerals

  • Author

    Ghanty, Sumit Kumar ; Shaikh, Soharab Hossain ; Chaki, Nabendu

  • Author_Institution
    A.K Choudhury Sch. of Inf. Technol., Univ. of Calcutta, Kolkata, India
  • fYear
    2010
  • fDate
    8-10 Oct. 2010
  • Firstpage
    54
  • Lastpage
    59
  • Abstract
    This paper presents a method for recognizing isolated spoken Bengali numerals. Noisy audio samples have been considered as input in this study. Mel frequency cepstral coefficients (MFCC) have been used for extraction of feature from the audio samples. Vector quantization is applied to reduce the dimension of the feature vectors and to generate a vector codebook for the numerals. The classification is based on the dynamic time warping (DTW) and a minimum distance classifier based on Euclidean distance measure. Both the speaker dependent and speaker independent situations have been considered for checking accuracy. Results show the limitations of MFCC based standard speech processing approach in speaker independent spoken digit recognition scenario in the presence of noise.
  • Keywords
    cepstral analysis; feature extraction; natural language processing; speaker recognition; speech coding; time warp simulation; vector quantisation; Euclidean distance; Mel frequency cepstral coefficient; distance classifier; dynamic time warping; feature extraction; noisy audio sample; speaker Recognition; speech processing; spoken Bengali numeral; vector codebook; vector quantization; Classification algorithms; Mel frequency cepstral coefficient; Noise; Speech; Speech recognition; Support vector machine classification; Vector quantization; Euclidean distance classifier; Mel frequency cepstral coefficients; Spoken numeral recognition; dynamic time warping; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Information Systems and Industrial Management Applications (CISIM), 2010 International Conference on
  • Conference_Location
    Krackow
  • Print_ISBN
    978-1-4244-7817-0
  • Type

    conf

  • DOI
    10.1109/CISIM.2010.5643692
  • Filename
    5643692