مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust sound event recognition using convolutional neural networks

DocumentCode :

3414500

Title :

Robust sound event recognition using convolutional neural networks

Author :

Haomin Zhang ; McLoughlin, Ian ; Yan Song

Author_Institution :

Nat. Eng. Lab. of Speech & Language Inf. Process., Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2015

fDate :

19-24 April 2015

Firstpage :

559

Lastpage :

563

Abstract :

Traditional sound event recognition methods based on informative front end features such as MFCC, with back end sequencing methods such as HMM, tend to perform poorly in the presence of interfering acoustic noise. Since noise corruption may be unavoidable in practical situations, it is important to develop more robust features and classifiers. Recent advances in this field use powerful machine learning techniques with high dimensional input features such as spectrograms or auditory image. These improve robustness largely thanks to the discriminative capabilities of the back end classifiers. We extend this further by proposing novel features derived from spectrogram energy triggering, allied with the powerful classification capabilities of a convolutional neural network (CNN). The proposed method demonstrates excellent performance under noise-corrupted conditions when compared against state-of-the-art approaches on standard evaluation tasks. To the author´s knowledge this in the first application of CNN in this field.

Keywords :

neural nets; speech recognition; ASR; CNN; HMM; MFCC; auditory image; automatic speech recognition; convolutional neural networks; informative front end features; interfering acoustic noise; machine learning techniques; noise corruption; robust sound event recognition; spectrogram image; Accuracy; Noise; Robustness; Smoothing methods; Spectrogram; Speech; Speech recognition; Machine hearing; auditory event detection; convolutional neural networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location :

South Brisbane, QLD

Type :

conf

DOI :

10.1109/ICASSP.2015.7178031

Filename :

7178031

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3414500