• DocumentCode
    756852
  • Title

    Experiments with fast Fourier transform, linear predictive and cepstral coefficients in dysarthric speech recognition algorithms using hidden Markov model

  • Author

    Polur, Prasad D. ; Miller, Gerald E.

  • Author_Institution
    Dept. of Biomed. Eng., Virginia Commonwealth Univ., Richmond, VA, USA
  • Volume
    13
  • Issue
    4
  • fYear
    2005
  • Firstpage
    558
  • Lastpage
    561
  • Abstract
    In this study, a hidden Markov Model was constructed and conditions were investigated that would provide improved performance for a dysarthric speech (isolated word) recognition system. The speaker dependant system was intended to act as an assistive/control tool. A small size vocabulary spoken by three cerebral palsy subjects was chosen. Fast Fourier transform, linear predictive, and Mel frequency cepstral coefficients extracted from data provided training input to several whole-word hidden Markov model configurations. The effect of model structure, number of states, and frame rates were also investigated. It was noted that a 10-state ergodic model using 15 msec frames was better than other configurations. Furthermore, it was found that a Mel cepstrum based model outperformed a fast Fourier transform and linear prediction based model. The system offers effective and robust application as a rehabilitation and/or control tool to assist dysarthric motor impaired individuals.
  • Keywords
    cepstral analysis; fast Fourier transforms; hidden Markov models; patient rehabilitation; physiological models; speech recognition; Mel frequency cepstral coefficients; cerebral palsy subjects; dysarthric motor impaired individuals; dysarthric speech recognition; ergodic model; fast Fourier transform; hidden Markov model; linear prediction; rehabilitation; Birth disorders; Cepstral analysis; Control systems; Data mining; Fast Fourier transforms; Hidden Markov models; Mel frequency cepstral coefficient; Predictive models; Speech recognition; Vocabulary; Cerebral palsy; Mel frequency cepstral coefficients; dysarthric speech; fast Fourier coefficients; hidden Markov model; linear prediction coefficients; speech recognition; Algorithms; Artificial Intelligence; Cerebral Palsy; Dysarthria; Fourier Analysis; Humans; Linear Models; Markov Chains; Models, Biological; Pattern Recognition, Automated; Signal Processing, Computer-Assisted; Speech Recognition Software;
  • fLanguage
    English
  • Journal_Title
    Neural Systems and Rehabilitation Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1534-4320
  • Type

    jour

  • DOI
    10.1109/TNSRE.2005.856074
  • Filename
    1556613