• DocumentCode
    463737
  • Title

    Analysis of Audio Clustering using Word Descriptions

  • Author

    Sundaram, Suresh ; Narayanan, Shrikanth

  • Author_Institution
    Dept. of Electr. Eng. Syst., Univ. of Southern California, Los Angeles, CA, USA
  • Volume
    2
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    We present an analysis of clustering audio clips using word descriptions that are imitative of sounds. These onomatopoeia words describe the acoustic properties of sources, and they can be useful in annotating a medium that cannot embed audio (e.g. text). First, an audio-to-word relationship is established by manually tagging a variety of audio clips (from a sound effects library) with onomatopoeia words. Using a newly proposed distance metric for word-level similarities, the feature vectors from the audio are clustered according to their tags, resulting in clusters with similarities in their onomatopoeic descriptions. By discriminant analysis of the clusters at the feature level, we present results on separability of these clusters. Our results indicate that by just using onomatopoeic descriptions, meaningful clusters with similar acoustic properties can be formed. However, in terms of audio feature level representation, clusters formed by some word groups such as buzz, fizz etc are better represented by signal features than percussive sounds such as clang, clank, tap.
  • Keywords
    audio signal processing; acoustic properties; audio clustering analysis; audio feature level representation; audio-to-word relationship; discriminant analysis; onomatopoeic descriptions; word descriptions; word-level similarities; Acoustic testing; Extraterrestrial measurements; Information retrieval; Labeling; Laboratories; Libraries; Nails; Ontologies; Pattern analysis; Speech analysis; analysis of audio clusters; audio information retrieval; audio ontology; onomatopoeia based audio descriptions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366349
  • Filename
    4217522