DocumentCode :
395244
Title :
HMM-neural network monophone models for computer-based articulation training for the hearing impaired
Author :
Devarajan, Mukund ; Meng, Fansheng ; Hix, Penny ; Zahorian, Stephen A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
A visual speech training aid for persons with hearing impairments has been developed using a Windows-based multimedia computer. Previous papers (Zahorian, S. et al., Int. Conf. on Spoken Language Processing, 2002; Zahorian and Nossair, Z.B., IEEE Trans. on Speech and Audio Processing, vol.7, no.4, p.414-25, 1999; Zimmer, A. et al., ICASSP, vol.6, p.3625-8, 1998; Zahorian and Jagharghi, A., J. Acoust. Soc. Amer., vol.94, no.4, p.1966-82, 1993) have describe the signal processing steps and display options for giving real-time feedback about the quality of pronunciation for 10 steady-state American English monopthong vowels (/aa/, /iy/, /uw/, /ae/, /er/, /ih/, /eh/, /ao/, /ah/, and /uh/). This vowel training aid is thus referred to as a vowel articulation training aid (VATA). We now describe methods to develop a monophone-based hidden Markov model/neural network recognizer such that real time visual feedback can be given about the quality of pronunciation of short words and phrases. Experimental results are reported which indicate a high degree of accuracy for labeling and segmenting the CVC (consonant-vowel-consonant) database developed for "training" the display.
Keywords :
computer based training; data visualisation; handicapped aids; hidden Markov models; learning (artificial intelligence); natural language interfaces; neural nets; speech processing; speech recognition; American English monopthong vowels; HMM-neural network monophone models; computer-based articulation training; hearing impaired; hearing impairments; hidden Markov model; signal processing; visual speech training aid; vowel articulation training aid; Auditory displays; Auditory system; Computer networks; Hidden Markov models; Natural languages; Neural networks; Signal processing; Speech processing; Speech recognition; Steady-state;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202373
Filename :
1202373
Link To Document :
بازگشت