Title :
Speech recognition in adverse environments using lip information
Author :
Thambiratnam, D. ; Wark, T. ; Sridharan, S. ; Chandran, V.
Author_Institution :
Queensland Univ. of Technol., Brisbane, Qld., Australia
Abstract :
The performance of automatic speech recognition systems deteriorates in the presence of noise. One known solution is to incorporate video information with an existing acoustic speech recognition system. We investigate the performance of the individual acoustic and visual sub-systems and then examine different ways in which the integration of the two systems may be performed. The system is to be implemented in real time on a Texas Instruments´ TMS320C80 DSP.
Keywords :
acoustic noise; image recognition; speech recognition; video signal processing; Texas Instruments TMS320C80 DSP; acoustic speech recognition system; acoustic sub-system; adverse environments; automatic speech recognition systems; lip information; noise; performance; video information; visual sub-system; Acoustic noise; Active shape model; Automatic speech recognition; Hidden Markov models; Image databases; Lips; Spatial databases; Speech enhancement; Speech processing; Speech recognition;
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld., Australia
Print_ISBN :
0-7803-4365-4
DOI :
10.1109/TENCON.1997.647279