Title :
Distant speech recognition in reverberant noisy conditions employing a microphone array
Author :
Morales-Cordovilla, Juan A. ; Hagmuller, M. ; Pessentheiner, Hannes ; Kubin, Gernot
Author_Institution :
Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria
Abstract :
This paper addresses the problem of distant speech recognition in reverberant noisy conditions employing a microphone array. We present a prototype system that can segment the utterances in real-time and generate robust ASR results off-line. The segmentation is carried out by a voice activity detector based on deep belief networks, the speaker localization by a position-pitch plane, and the enhancement by a novel combination of convex optimized beamforming and vector Taylor series compensation. All of the components are compared with other similar ones and justified in terms of word accuracy on a proposed database which simulates distant speech recognition in a home environment.
Keywords :
array signal processing; belief networks; convex programming; microphone arrays; signal detection; speaker recognition; speech enhancement; convex optimized beamforming; deep belief networks; distant speech recognition; home environment; microphone array; position-pitch plane; reverberant noisy conditions; robust ASR; speaker localization; speech enhancement; vector Taylor series compensation; voice activity detector; Accuracy; Arrays; Microphones; Noise; Speech; Speech recognition; Vectors; German database; PoPi speaker localization; convexoptimized beamforming; deep belief network voice activity detection; distant speech recognition; natural mixing; reverberant and noisy environment; vector Taylor series compensation;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon