DocumentCode :
117944
Title :
Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features
Author :
Kitaoka, Norihide ; Hayashi, Tomoki ; Takeda, Kazuya
Author_Institution :
Nagoya Unviersity, Nagoya, Japan
fYear :
2014
fDate :
9-12 Dec. 2014
Firstpage :
1
Lastpage :
5
Abstract :
In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance of the DBNF-based system was much more degraded by noise than the MFCC-based system, BSSA improved the performance of both methods greatly, especially when matched condition training of acoustic models was employed. These results show the effectiveness of BSSA for speech recognition.
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; neural nets; speech recognition; BSSA; DBNF; DNN; MFCC; acoustic model training; blind spatial subtraction array technique; deep bottleneck feature; deep neural network; in-vehicle condition; mel-frequency cepstral coefficient; noisy speech recognition; Feature extraction; Hidden Markov models; Noise; Speech; Speech enhancement; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location :
Siem Reap
Type :
conf
DOI :
10.1109/APSIPA.2014.7041556
Filename :
7041556
Link To Document :
بازگشت