Title :
A Neural Network based Audio Content Classification
Author :
Mitra, Vikramjit ; Wang, Chia J.
Author_Institution :
Univ. of Maryland, Greenbelt
Abstract :
The emergence of digital music in the Internet calls for a reliable real-time tool to analyze and properly categorize them for the users. To incorporate content or genre queries in Web searches, audio content analysis and classification is imperative. This paper proposes a set of audio content features and a parallel neural network architecture that addresses the task of automated content based audio classification. Feature sets based on signal periodicity, beat information, sub-band energy, mel-frequency cepstral coefficients and wavelet transforms are proposed and each of the feature sets are individually analyzed for their pertinence in the proposed task. A parallel multi-layered perceptron network is proposed which offers a classification accuracy of 84.4% to distinguish between 6 different genres. The proposed architecture is compared with a support vector machine based classifier and is found to perform superiorly than the later.
Keywords :
Internet; audio systems; classification; multilayer perceptrons; multimedia computing; music; support vector machines; wavelet transforms; Internet calls; Web searches; audio content analysis; audio content classification; beat information; digital music; mel-frequency cepstral coefficients; multilayered perceptron network; neural network; signal periodicity; sub-band energy; support vector machine; wavelet transforms; Cepstral analysis; Information analysis; Internet; Multilayer perceptrons; Neural networks; Signal analysis; Support vector machines; Wavelet analysis; Wavelet transforms; Web search;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371179