• DocumentCode
    2903574
  • Title

    Temporal Speech Normalization Methods Comparison in Speech Recognition Using Neural Network

  • Author

    Salam, Md Sah Bin Hj ; Mohamad, Dzulkifli ; Salleh, Sheikh Hussain Shaikh

  • Author_Institution
    Comp. Sci. & Info. Syst., Univ. Technol. Malaysia, Skudai, Malaysia
  • fYear
    2009
  • fDate
    4-7 Dec. 2009
  • Firstpage
    442
  • Lastpage
    447
  • Abstract
    Speech signal is temporally and acoustically varies. Recognition of speech by static input neural network requires temporal normalization of the speech to be equal to the number of input nodes of the NN while maintaining the properties of the speech. This paper compares three methods for speech temporal normalization namely the linear, extended linear and zero padded normalizations on isolated speech using different sets of learning parameters on multi layer perceptron neural network with adaptive learning. Although, previous work shows that linear normalization able to give high accuracy up to 95% on similar problem, the result in this experiment shows the opposite. The experimental result shows that zero padded normalization outperformed the two linear normalization methods using all the parameter sets tested. The highest recognition rate using zero padded normalization is 99% while linear and extended linear normalizations give only 74% and 76% respectively. This paper end before conclusion by comparing data used from previous work using linear normalization which gave high accuracy and the data used in this experiment which perform poorer.
  • Keywords
    neural nets; speech recognition; extended linear normalization; linear normalization; neural network; speech recognition; temporal speech normalization methods; zero padded normalization; Adaptive systems; Data mining; Feature extraction; Frequency; Neural networks; Pattern recognition; Spectral analysis; Speech analysis; Speech processing; Speech recognition; Adaptive Learning; Neural Network; Speech Recognition; Temporal Normalization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Pattern Recognition, 2009. SOCPAR '09. International Conference of
  • Conference_Location
    Malacca
  • Print_ISBN
    978-1-4244-5330-6
  • Electronic_ISBN
    978-0-7695-3879-2
  • Type

    conf

  • DOI
    10.1109/SoCPaR.2009.92
  • Filename
    5368649