DocumentCode :
542320
Title :
Hands-free continuous speech recognition in noise using a speaker beam-former based on spectrum-entropy
Author :
George, Nokas ; Evangelos, Dermatas
Author_Institution :
Department of Electrical & Computer Engineering, University of Patras, 26500, Hellas, Greece
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
Detection of the speaker position is a crucial task in hands-free speech recognition applications. In this paper we present a novel speech beam-former for noisy environments. Initially, the localization algorithm extracts a set of candidate directions of the signal sources using array signal processing methods in the frequency domain. Then, a minimum variance (MV) beam-former identifies the speech signal in the direction where the signal´s spectrum entropy is minimized. The proposed method is evaluated by a phoneme recognition system using noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25–0 dB, show almost perfect estimation of the speaker DOA in all cases. As a consequence, the recognition rate increases significantly compared to the rate obtained by a single microphone. The recognition improvement increases especially in very low SNRs.
Keywords :
Arrays; Entropy; Hidden Markov models; Robustness; Speech; Speech recognition; Three dimensional displays;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743882
Filename :
5743882
Link To Document :
بازگشت