مرکز منطقه ای اطلاع رساني علوم و فناوري - Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features

DocumentCode :

117944

Title :

Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features

Author :

Kitaoka, Norihide ; Hayashi, Tomoki ; Takeda, Kazuya

Author_Institution :

Nagoya Unviersity, Nagoya, Japan

fYear :

2014

fDate :

9-12 Dec. 2014

Firstpage :

Lastpage :

Abstract :

In this study, we investigate the effect of blind spatial subtraction arrays (BSSA) on speech recognition systems by comparing the performance of a method using Mel-Frequency Cepstral Coefficients (MFCCs) with a method using Deep Bottleneck Features (DBNF) based on Deep Neural Networks (DNN). Performance is evaluated under various conditions, including noisy, in-vehicle conditions. Although performance of the DBNF-based system was much more degraded by noise than the MFCC-based system, BSSA improved the performance of both methods greatly, especially when matched condition training of acoustic models was employed. These results show the effectiveness of BSSA for speech recognition.

Keywords :

acoustic signal processing; cepstral analysis; feature extraction; neural nets; speech recognition; BSSA; DBNF; DNN; MFCC; acoustic model training; blind spatial subtraction array technique; deep bottleneck feature; deep neural network; in-vehicle condition; mel-frequency cepstral coefficient; noisy speech recognition; Feature extraction; Hidden Markov models; Noise; Speech; Speech enhancement; Speech recognition; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)

Conference_Location :

Siem Reap

Type :

conf

DOI :

10.1109/APSIPA.2014.7041556

Filename :

7041556

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=117944