مرکز منطقه ای اطلاع رساني علوم و فناوري - A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework

DocumentCode :

3662447

Title :

A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework

Author :

Duraid Y. Mohammed;Philip J. Duncan;Muhammad M. Al-Maathidi;Francis F. Li

Author_Institution :

School of Computing, Science and Engineering, University of Salford, Salford, Greater Manchester, UK

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

1084

Lastpage :

1089

Abstract :

Ever increasing volumes of media content and the desire to extract information from media archives motivate the studies into semantic audio information mining. Much research in this filed concerns development of bespoke systems, in which sound tracks are exclusively classified and segmented, and a specific type of sound is recognized and analyzed. This approach however is detrimental to the complete extraction of all relevant semantic information and audio scene analysis. The current study addresses the issues of sound tracks with overlapped music, speech and ambient sounds, and explores how MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) can be extended to mixed and overlapped soundtrack applications. The MARSYAS has been adapted to this application by means of adopting additional speech cleaning algorithms. The proposed new system can analyze arbitrary sound tracks and timestamp the occurrence of music and speech, allowing overlaps, in the form of a “sound score” for further recognition methods to extract music score and text information. Validation tests have shown that the new system handles overlapping cases and is therefore capable of extracting more information than other existing methods.

Keywords :

"Speech","Music","Speech recognition","Semantics","Multiple signal classification","Speech enhancement","Noise"

Publisher :

ieee

Conference_Titel :

Industrial Informatics (INDIN), 2015 IEEE 13th International Conference on

ISSN :

1935-4576

Electronic_ISBN :

2378-363X

Type :

conf

DOI :

10.1109/INDIN.2015.7281886

Filename :

7281886

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3662447