• DocumentCode
    357054
  • Title

    Source segmentation for structured audio

  • Author

    Melih, Kathy ; Gonzalez, Ruben

  • Author_Institution
    Sch. of Inf. Technol., Griffith Univ., Gold Coast, Qld., Australia
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    811
  • Abstract
    With the increasing demand for content based manipulation of ever growing stores of audio data and the emergence of MPEG-7 has come the need for structured audio representations. However, while the necessity of such a representation has been recognised and, to some extent, its essential features have been identified, its actual development and implementation have generally been relegated as problems for another time or person to solve. This paper attempts to address the shortfall by defining an audio structure that will allow content-based manipulation of audio at the level of audio objects. The paper then summarises the processes required to generate such a structure. Further, details are provided as to how the second level of this structure can be derived from a low-level perceptually based audio representation previously developed by the authors to satisfy the requirements at the lowest level of the audio structure. Finally, initial experimental results are presented
  • Keywords
    audio signal processing; MPEG-7; audio objects; content based manipulation; low-level perceptually based audio representation; source segmentation; structured audio representations; Auditory system; Content based retrieval; Data mining; Feature extraction; Gold; Information technology; MPEG 7 Standard; Music information retrieval; Speech; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
  • Conference_Location
    New York, NY
  • Print_ISBN
    0-7803-6536-4
  • Type

    conf

  • DOI
    10.1109/ICME.2000.871484
  • Filename
    871484