Title :
A scalable feature learning and tag prediction framework for natural environment sounds
Author :
Sattigeri, P. ; Thiagarajan, J.J. ; Shah, M. ; Ramamurthy, K.N. ; Spanias, A.
Author_Institution :
SenSIP Center, Arizona State Univ., Tempe, AZ, USA
Abstract :
Building feature extraction approaches that can effectively characterize natural environment sounds is challenging due to the dynamic nature. In this paper, we develop a framework for feature extraction and obtaining semantic inferences from such data. In particular, we propose a new pooling strategy for deep architectures, that can preserve the temporal dynamics in the resulting representation. By constructing an ensemble of semantic embeddings, we employ an l1-reconstruction based prediction algorithm for estimating the relevant tags. We evaluate our approach on challenging environmental sound recognition datasets, and show that the proposed features outperform traditional spectral features.
Keywords :
acoustic signal processing; feature extraction; learning (artificial intelligence); Iι-reconstruction based prediction algorithm; environmental sound recognition; feature extraction approach; scalable feature learning; semantic inferences; tag prediction framework; Computational modeling; Computer architecture; Correlation; Dictionaries; Feature extraction; Predictive models; Semantics;
Conference_Titel :
Signals, Systems and Computers, 2014 48th Asilomar Conference on
Print_ISBN :
978-1-4799-8295-0
DOI :
10.1109/ACSSC.2014.7094773