مرکز منطقه ای اطلاع رساني علوم و فناوري - Mixed Type Audio Classification with Support Vector Machine

DocumentCode :

3623836

Title :

Mixed Type Audio Classification with Support Vector Machine

Author :

Lei Chen;Sule Gunduz;M. Tamer Ozsu

Author_Institution :

Department of Computer Science, Hong Kong University of Sci. and Tech., leichen@ust.hk

fYear :

2006

fDate :

7/1/2006 12:00:00 AM

Firstpage :

781

Lastpage :

784

Abstract :

Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story segment, extraction of speech segments from video, and content-based video retrieval. Though the classification of audio into single type such as music, speech, environmental sound and silence is well studied, classification of mixed type audio data, such as clips having speech with music as background, is still considered a difficult problem. In this paper, we present a mixed type audio classification system based on support vector machine (SVM). In order to capture characteristics of different types of audio data, besides selecting audio features, we also design four different representation formats for each feature. Our SVM-based audio classifier can classify audio data into five types: music, speech, environment sound, speech mixed with music, and music mixed with environment sound. The experimental results show that our system outperforms other classification systems using k nearest neighbor (k-NN), neural network (NN), and Naive Bayes (NB)

Keywords :

"Support vector machines","Support vector machine classification","Speech analysis","Neural networks","Streaming media","Data mining","Music information retrieval","Content based retrieval","Nearest neighbor searches","Niobium"

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2006 IEEE International Conference on

ISSN :

1945-7871

Print_ISBN :

1-4244-0366-7

Electronic_ISBN :

1945-788X

Type :

conf

DOI :

10.1109/ICME.2006.262954

Filename :

4036716

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3623836