مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge

DocumentCode :

3744876

Title :

Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge

Author :

Thanh T Vu;Benjamin Bigot;Eng Siong Chng

Author_Institution :

Rolls-Royce@NTU Corporate Lab, Nanyang Technological University, Singapore

fYear :

2015

Firstpage :

423

Lastpage :

429

Abstract :

In this paper we present our contribution to the third CHiME challenge on speech separation and recognition for noisy multi-channel recordings. The use-case of the challenge consists in single speaker utterances recorded in highly non-stationary noisy environments using a 6-microphone array mounted on a tablet computer. The front-end of our system is performing speech enhancement by cascading a cross-correlation-based channel selection, Signal Dependent MVDR beamforming and online source separation based on sparse NMF. The back-end module is a state-of-the-art speech recognition system with DNN acoustic models trained on fMLLR features and a RNN Language Model. Our system reaches an overall WER of 11.94% on real test recordings, achieving a relative improvement of 65% compared to the baseline system.

Keywords :

"Speech","Noise measurement","Speech enhancement","Acoustics","Microphones","Speech recognition","Training"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404826

Filename :

7404826

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744876