• DocumentCode
    1968765
  • Title

    Classification of vowel sounds using MFCC and feed forward Neural Network

  • Author

    Paulraj, M.P. ; Yaacob, Sazali Bin ; Nazri, Ahamad ; Kumar, Sathees

  • Author_Institution
    Sch. of Mechatron. Eng., Univ. Malaysia Perlis, Perlis
  • fYear
    2009
  • fDate
    6-8 March 2009
  • Firstpage
    59
  • Lastpage
    62
  • Abstract
    The English language as spoken by Malaysians varies from place to place and differs from one ethnic community and its sub-group to another. Hence, it is necessary to develop an exclusive speech to text translation system for understanding the English pronunciation as spoken by Malaysians. Speech translation is a process of both speech recognition and equivalent phonemic to word translation. Speech recognition is a process of identifying phonemes from the speech segment. In this paper, the initial step for speech recognition by identifying the phoneme features is proposed. In order to classify the phoneme features, Mel-frequency cepstral coefficients (MFCC) are computed in this paper. A simple feed forward neural network (FFNN) trained by back propagation procedure is proposed for identifying the phonemes features. The extracted MFCC coefficients are used as input to a neural network classifier for associating it to one of the 11 classes.
  • Keywords
    acoustic signal processing; backpropagation; cepstral analysis; feature extraction; feedforward neural nets; signal classification; speech processing; speech recognition; English language; English pronunciation; FFNN classifier; MFCC coefficient; Mel-frequency cepstral coefficient; back propagation procedure; equivalent phonemic; feed forward neural network; phoneme feature classification; speech recognition; speech segment; speech-to-text translation system; vowel sound classification; word translation; Feature extraction; Feedforward neural networks; Feeds; Filters; Hidden Markov models; Mel frequency cepstral coefficient; Neural networks; Speech processing; Speech recognition; Tongue; Digital signal processing; Mel-frequency cepstrsal coefficients; Phonemes; Speech to text translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing & Its Applications, 2009. CSPA 2009. 5th International Colloquium on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-4151-8
  • Electronic_ISBN
    978-1-4244-4152-5
  • Type

    conf

  • DOI
    10.1109/CSPA.2009.5069189
  • Filename
    5069189