• DocumentCode
    3174469
  • Title

    Parametric feature-based voice recognition system using artificial neural network

  • Author

    Bodruzzaman, M. ; Kuah, K. ; Jamil, T. ; Wang, C. ; Li, X.

  • Author_Institution
    Dept. of Electr. Eng. , Tennessee State Univ., Nashville, TN, USA
  • fYear
    1993
  • fDate
    4-7 Apr 1993
  • Firstpage
    0.625
  • Abstract
    A speaker identification experiment was conducted using an artificial neural network. The speech data were collected from nine different speakers saying the same word, "hello". The speech data were then preprocessed for signal conditioning. Fourteen feature parameters were obtained: 12 of them are the coefficients of the 12th order linear predictor (LPC); and the other two were selected as the peak and bandwidth of the spectral envelope. These 14 feature parameters then served as the inputs to the neural network for speaker classification. A standard two-layer feedforward neural network was trained to identify different feature sets associated with the corresponding speakers. The network size was selected to be 14-8-4 (14 input, 8 hidden, and 4 output units). Nine utterances from each speaker were used as training data, and the other one served as testing data. The results showed that the trained network can correctly identify the speakers to an accuracy of 90%. The success rate could be increased by increasing the number of utterances per speaker
  • Keywords
    feedforward neural nets; linear predictive coding; multilayer perceptrons; speech coding; speech recognition; LPC; artificial neural network; bandwidth; feature parameters; feature-based voice recognition system; network size; peak; signal conditioning; speaker classification; speaker identification experiment; spectral envelope; speech data; testing data; trained network; training data; two-layer feedforward neural network; Artificial neural networks; Bandwidth; Difference equations; Feedforward neural networks; Linear predictive coding; Loudspeakers; Neural engineering; Neural networks; Random sequences; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Southeastcon '93, Proceedings., IEEE
  • Conference_Location
    Charlotte, NC
  • Print_ISBN
    0-7803-1257-0
  • Type

    conf

  • DOI
    10.1109/SECON.1993.465673
  • Filename
    465673