مرکز منطقه ای اطلاع رساني علوم و فناوري - Content-Based Classification and Segmentation of Mixed-Type Audio by Using MPEG-7 Features

DocumentCode :

3632828

Title :

Content-Based Classification and Segmentation of Mixed-Type Audio by Using MPEG-7 Features

Author :

Ebru Dogan;Mustafa Sert;Adnan Yazici

Author_Institution :

Commun. Div., ASELSAN Electron. Ind., Inc., Ankara, Turkey

fYear :

2009

Firstpage :

152

Lastpage :

157

Abstract :

This paper describes the development of a generated solution for classification and segmentation of broadcast news audio. A sound stream is segmented by classifying each sub-segment into silence, pure speech, music, environmental sound, speech over music, and speech over environmental sound classes in multiple steps. Support Vector Machines and Hidden Markov Models are employed for classification and these models are trained by using different sets of MPEG-7 features. A series of tests was conducted on hand-labeled audio tracks of TRECVID broadcast news to evaluate the performance of MPEG-7 features and the selected classification methods in the proposed solution. The results obtained from our experiments clearly demonstrate that classification of mixed type audio data using Audio Spectrum Centroid, Audio Spectrum Spread, and Audio Spectrum Flatness features has considerably high accuracy rates in news domain.

Keywords :

"MPEG 7 Standard","Hidden Markov models","Broadcasting","Speech","Loudspeakers","Support vector machines","Support vector machine classification","Music","Streaming media","Testing"

Publisher :

ieee

Conference_Titel :

Advances in Multimedia, 2009. MMEDIA ´09. First International Conference on

Print_ISBN :

978-0-7695-3693-4

Type :

conf

DOI :

10.1109/MMEDIA.2009.35

Filename :

5206897

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3632828