مرکز منطقه ای اطلاع رساني علوم و فناوري - Combining evidence from subsegmental and segmental features for audio clip classification

DocumentCode :

2529854

Title :

Combining evidence from subsegmental and segmental features for audio clip classification

Author :

Bajpai, Anvita ; Yegnanarayana, B.

Author_Institution :

DeciDyn Syst., Bangalore

fYear :

2008

fDate :

19-21 Nov. 2008

Firstpage :

Lastpage :

Abstract :

In this paper, we demonstrate the complementary nature of audio-specific excitation source (subsegmental) information present in the linear prediction (LP) residual, to the information derived using spectral (segmental) features, for audio clip classification. Classes considered for study are advertisement, cricket, cartoon, football and news, and the data is collected from TV broadcast with large intra-class variability. A baseline system based on segmental features and hidden Markov models (HMM), gives classification accuracy of 62.08%. Another baseline system, based on subsegmental features present in the LP residual, built using autoassociative neural networks (AANN) to model audio components, and multilayer perceptron (MLP) to classify audio, gives classification accuracy of 52.72%. The two systems are combined at abstract level and give classification accuracy of 86.96%, indicating their complementary nature. The rank and measurement level combination of the two systems is further used to enhance the classification accuracy to 92.97%.

Keywords :

audio signal processing; hidden Markov models; signal classification; audio clip classification; audio components; audio-specific excitation source; autoassociative neural networks; hidden Markov models; linear prediction residual; multilayer perceptron; spectral features; subsegmental features; Bandwidth; Hidden Markov models; Indexing; Information technology; Multi-layer neural network; Multilayer perceptrons; Music information retrieval; Neural networks; Speech recognition; TV broadcasting; Audio indexing; Combining classifiers; Linear prediction residual; Neural networks; Pattern classification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

TENCON 2008 - 2008 IEEE Region 10 Conference

Conference_Location :

Hyderabad

Print_ISBN :

978-1-4244-2408-5

Electronic_ISBN :

978-1-4244-2409-2

Type :

conf

DOI :

10.1109/TENCON.2008.4766692

Filename :

4766692

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2529854