Title of article :
Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals
Author/Authors :
Polur، نويسنده , , Prasad D. and Miller، نويسنده , , Gerald E.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2006
Pages :
8
From page :
741
To page :
748
Abstract :
Computer speech recognition of individuals with dysarthria, such as cerebral palsy patients requires a robust technique that can handle conditions of very high variability and limited training data. In this study, application of a 10 state ergodic hidden Markov model (HMM)/artificial neural network (ANN) hybrid structure for a dysarthric speech (isolated word) recognition system, intended to act as an assistive tool, was investigated. A small size vocabulary spoken by three cerebral palsy subjects was chosen. The effect of such a structure on the recognition rate of the system was investigated by comparing it with an ergodic hidden Markov model as a control tool. This was done in order to determine if this modified technique contributed to enhanced recognition of dysarthric speech. The speech was sampled at 11 kHz. Mel frequency cepstral coefficients were extracted from them using 15 ms frames and served as training input to the hybrid model setup. The subsequent results demonstrated that the hybrid model structure was quite robust in its ability to handle the large variability and non-conformity of dysarthric speech. The level of variability in input dysarthric speech patterns sometimes limits the reliability of the system. However, its application as a rehabilitation/control tool to assist dysarthric motor impaired individuals holds sufficient promise.
Keywords :
Ergodic model , Hidden Markov model , Artificial neural network , Mel frequency cepstral coefficients , Dysarthric speech , Cerebral palsy , speech recognition
Journal title :
Medical Engineering and Physics
Serial Year :
2006
Journal title :
Medical Engineering and Physics
Record number :
1729168
Link To Document :
بازگشت