DocumentCode :
2699630
Title :
Optimization of dysarthric speech recognition
Author :
Chen, Fangxin ; Kostov, Aleksandar
Author_Institution :
Dept. of Linguistics, Alberta Univ., Edmonton, Alta., Canada
Volume :
4
fYear :
1997
fDate :
30 Oct-2 Nov 1997
Firstpage :
1436
Abstract :
Explores the residual vocal ability of people who have severe motor impairments accompanied with severe dysarthria, and develops methods for improving the performance of automatic speech recognition (ASR) of dysarthric speech. The target applications for this technology are in the development of communication and control devices for these people. In our speech recognition system, we developed an adaptive word detection algorithm to detect words in highly irregular dysarthric speech. We also implemented perceptually-based mel frequency cepstrum coefficients (MFCC) for the parametric representation of the speech signal, and we adopted the left-to-right discrete hidden Markov model (DHMM) for speech pattern recognition. The system was tested with one person who has cerebral palsy and dysarthria, reducing the intelligibility of her speech to less than 15%. Our initial results on a word set consisting of ten digits demonstrated that recognition rates above 90% can be achieved if more than ten repetitions are used for training
Keywords :
adaptive systems; cepstral analysis; handicapped aids; hidden Markov models; optimisation; speech intelligibility; speech recognition; 10-digit word set; adaptive word detection algorithm; automatic speech recognition performance optimization; cerebral palsy; communication devices; communication disorders; control devices; dysarthric speech recognition; irregular dysarthric speech; left-to-right discrete hidden Markov model; motor impairments; perceptually-based mel frequency cepstrum coefficients; recognition rates; repetitions; residual vocal ability; speech intelligibility; speech signal parametric representation; training; Automatic control; Automatic speech recognition; Cepstrum; Communication system control; Detection algorithms; Hidden Markov models; Mel frequency cepstral coefficient; Pattern recognition; Speech recognition; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 1997. Proceedings of the 19th Annual International Conference of the IEEE
Conference_Location :
Chicago, IL
ISSN :
1094-687X
Print_ISBN :
0-7803-4262-3
Type :
conf
DOI :
10.1109/IEMBS.1997.756975
Filename :
756975
Link To Document :
بازگشت