DocumentCode :
3662447
Title :
A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework
Author :
Duraid Y. Mohammed;Philip J. Duncan;Muhammad M. Al-Maathidi;Francis F. Li
Author_Institution :
School of Computing, Science and Engineering, University of Salford, Salford, Greater Manchester, UK
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
1084
Lastpage :
1089
Abstract :
Ever increasing volumes of media content and the desire to extract information from media archives motivate the studies into semantic audio information mining. Much research in this filed concerns development of bespoke systems, in which sound tracks are exclusively classified and segmented, and a specific type of sound is recognized and analyzed. This approach however is detrimental to the complete extraction of all relevant semantic information and audio scene analysis. The current study addresses the issues of sound tracks with overlapped music, speech and ambient sounds, and explores how MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) can be extended to mixed and overlapped soundtrack applications. The MARSYAS has been adapted to this application by means of adopting additional speech cleaning algorithms. The proposed new system can analyze arbitrary sound tracks and timestamp the occurrence of music and speech, allowing overlaps, in the form of a “sound score” for further recognition methods to extract music score and text information. Validation tests have shown that the new system handles overlapping cases and is therefore capable of extracting more information than other existing methods.
Keywords :
"Speech","Music","Speech recognition","Semantics","Multiple signal classification","Speech enhancement","Noise"
Publisher :
ieee
Conference_Titel :
Industrial Informatics (INDIN), 2015 IEEE 13th International Conference on
ISSN :
1935-4576
Electronic_ISBN :
2378-363X
Type :
conf
DOI :
10.1109/INDIN.2015.7281886
Filename :
7281886
Link To Document :
بازگشت