DocumentCode
463737
Title
Analysis of Audio Clustering using Word Descriptions
Author
Sundaram, Suresh ; Narayanan, Shrikanth
Author_Institution
Dept. of Electr. Eng. Syst., Univ. of Southern California, Los Angeles, CA, USA
Volume
2
fYear
2007
fDate
15-20 April 2007
Abstract
We present an analysis of clustering audio clips using word descriptions that are imitative of sounds. These onomatopoeia words describe the acoustic properties of sources, and they can be useful in annotating a medium that cannot embed audio (e.g. text). First, an audio-to-word relationship is established by manually tagging a variety of audio clips (from a sound effects library) with onomatopoeia words. Using a newly proposed distance metric for word-level similarities, the feature vectors from the audio are clustered according to their tags, resulting in clusters with similarities in their onomatopoeic descriptions. By discriminant analysis of the clusters at the feature level, we present results on separability of these clusters. Our results indicate that by just using onomatopoeic descriptions, meaningful clusters with similar acoustic properties can be formed. However, in terms of audio feature level representation, clusters formed by some word groups such as buzz, fizz etc are better represented by signal features than percussive sounds such as clang, clank, tap.
Keywords
audio signal processing; acoustic properties; audio clustering analysis; audio feature level representation; audio-to-word relationship; discriminant analysis; onomatopoeic descriptions; word descriptions; word-level similarities; Acoustic testing; Extraterrestrial measurements; Information retrieval; Labeling; Laboratories; Libraries; Nails; Ontologies; Pattern analysis; Speech analysis; analysis of audio clusters; audio information retrieval; audio ontology; onomatopoeia based audio descriptions;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366349
Filename
4217522
Link To Document