Title :
Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model
Author :
Polur, Prasad D. ; Miller, Gerald E.
Author_Institution :
Dept. of Biomed. Eng., Virginia Commonwealth Univ., Richmond, VA, USA
Abstract :
In this study, a hidden Markov Model was constructed and conditions were investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system. The speaker dependant system was intended to act as an assistive/control tool. A small size vocabulary spoken by three cerebral palsy subjects was chosen. Fast Fourier transform, linear predictive, and Mel frequency cepstral coefficients extracted from data provided training input to several whole-word hidden Markov model configurations. The effect of model structure, number of states, and frame rates were also investigated. It was noted that a 10-state ergodic model using 15 msec frames was better than other configurations. Furthermore, it was found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model. The system offers effective and robust application as a rehabilitation and/or control tool to assist dysarthric motor impaired individuals.
Keywords :
cepstral analysis; fast Fourier transforms; hidden Markov models; patient rehabilitation; physiological models; speech recognition; Mel frequency cepstral coefficients; cerebral palsy subjects; dysarthric motor impaired individuals; dysarthric speech recognition; ergodic model; fast Fourier transform; hidden Markov model; linear prediction; rehabilitation; Birth disorders; Cepstral analysis; Control systems; Data mining; Fast Fourier transforms; Hidden Markov models; Mel frequency cepstral coefficient; Predictive models; Speech recognition; Vocabulary; Cerebral palsy; Mel frequency cepstral coefficients; dysarthric speech; fast Fourier coefficients; hidden Markov model; linear prediction coefficients; speech recognition; Algorithms; Artificial Intelligence; Cerebral Palsy; Dysarthria; Fourier Analysis; Humans; Linear Models; Markov Chains; Models, Biological; Pattern Recognition, Automated; Signal Processing, Computer-Assisted; Speech Recognition Software;
Journal_Title :
Neural Systems and Rehabilitation Engineering, IEEE Transactions on
DOI :
10.1109/TNSRE.2005.856074