• DocumentCode
    3662447
  • Title

    A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework

  • Author

    Duraid Y. Mohammed;Philip J. Duncan;Muhammad M. Al-Maathidi;Francis F. Li

  • Author_Institution
    School of Computing, Science and Engineering, University of Salford, Salford, Greater Manchester, UK
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1084
  • Lastpage
    1089
  • Abstract
    Ever increasing volumes of media content and the desire to extract information from media archives motivate the studies into semantic audio information mining. Much research in this filed concerns development of bespoke systems, in which sound tracks are exclusively classified and segmented, and a specific type of sound is recognized and analyzed. This approach however is detrimental to the complete extraction of all relevant semantic information and audio scene analysis. The current study addresses the issues of sound tracks with overlapped music, speech and ambient sounds, and explores how MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) can be extended to mixed and overlapped soundtrack applications. The MARSYAS has been adapted to this application by means of adopting additional speech cleaning algorithms. The proposed new system can analyze arbitrary sound tracks and timestamp the occurrence of music and speech, allowing overlaps, in the form of a “sound score” for further recognition methods to extract music score and text information. Validation tests have shown that the new system handles overlapping cases and is therefore capable of extracting more information than other existing methods.
  • Keywords
    "Speech","Music","Speech recognition","Semantics","Multiple signal classification","Speech enhancement","Noise"
  • Publisher
    ieee
  • Conference_Titel
    Industrial Informatics (INDIN), 2015 IEEE 13th International Conference on
  • ISSN
    1935-4576
  • Electronic_ISBN
    2378-363X
  • Type

    conf

  • DOI
    10.1109/INDIN.2015.7281886
  • Filename
    7281886