DocumentCode :
2155459
Title :
Robust speech recognition in a high interference real room environment using blind speech extraction
Author :
Koutras, A. ; Dermatas, E.
Author_Institution :
Electr. & Comput. Eng. Dept., Patras Univ., Greece
Volume :
1
fYear :
2002
fDate :
2002
Firstpage :
167
Abstract :
We present a novel blind signal extraction (BSE) method for robust speech recognition in a real room environment under the coexistence of simultaneous interfering non-speech sources. The proposed method is capable of extracting the target speaker´s voice based on a maximum kurtosis criterion. Extensive phoneme recognition experiments have proved the proposed network´s efficacy when used in a real-life situation of a talking speaker with the coexistence of various non-speech sources (e.g. music and noise), achieving a phoneme recognition improvement of about 23%, especially under high interference. Furthermore, comparison of the proposed network to known blind source separation networks, commonly used in similar situations, showed lower computational complexity and better recognition accuracy of the BSE network, making it ideal to be used as a front-end to existing ASR systems.
Keywords :
acoustic noise; blind source separation; computational complexity; speech recognition; statistical analysis; ASR; acoustic interference; automatic speech recognition; blind signal extraction; blind source separation; blind speech extraction; cocktail party effect; computational complexity; high interference real room environment; maximum kurtosis criterion; statistical analysis; Automatic speech recognition; Blind source separation; Data mining; Delay; Interference; Loudspeakers; Microphones; Robustness; Source separation; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN :
0-7803-7503-3
Type :
conf
DOI :
10.1109/ICDSP.2002.1027867
Filename :
1027867
Link To Document :
بازگشت