DocumentCode :
3688616
Title :
Environmental sound classification with convolutional neural networks
Author :
Karol J. Piczak
Author_Institution :
Institute of Electronic Systems, Warsaw University of Technology
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper evaluates the potential of convolutional neural networks in classifying short audio clips of environmental sounds. A deep model consisting of 2 convolutional layers with max-pooling and 2 fully connected layers is trained on a low level representation of audio data (segmented spectrograms) with deltas. The accuracy of the network is evaluated on 3 public datasets of environmental and urban recordings. The model outperforms baseline implementations relying on mel-frequency cepstral coefficients and achieves results comparable to other state-of-the-art approaches.
Keywords :
"Neural networks","Training","Accuracy","Convolution","Convolutional codes","Yttrium","Pattern recognition"
Publisher :
ieee
Conference_Titel :
Machine Learning for Signal Processing (MLSP), 2015 IEEE 25th International Workshop on
Type :
conf
DOI :
10.1109/MLSP.2015.7324337
Filename :
7324337
Link To Document :
بازگشت