DocumentCode :
2954938
Title :
Generating Expressive Summaries for Speech and Musical Audio using Self-Similarity Clues
Author :
Sert, Mustafa ; Baykal, Buyurman ; Yazici, Adnan
Author_Institution :
Dept. of Comput. Eng., Baskent Univ., Ankara
fYear :
2006
fDate :
9-12 July 2006
Firstpage :
941
Lastpage :
944
Abstract :
We present a novel algorithm for structural analysis of audio to detect repetitive patterns that are suitable for content-based audio information retrieval systems, since repetitive patterns can provide valuable information about the content of audio, such as a chorus or a concept. The audio spectrum flatness (ASF) feature of the MPEG-7 standard, although not having been considered as much as other feature types, has been utilized and evaluated as the underlying feature set. Expressive summaries are chosen as the longest patterns by the k-means clustering algorithm. Proposed approach is evaluated on a test bed consisting of popular song and speech clips based on the ASF feature. The well known Mel frequency cepstral coefficients (MFCCs) are also considered in the experiments for the evaluation of features. Experiments show that, all the repetitive patterns and their locations are obtained with the accuracy of 93% and 78% for music and speech, respectively
Keywords :
acoustic signal processing; audio signal processing; audio systems; cepstral analysis; content-based retrieval; data compression; fractals; music; pattern clustering; speech; ASF; MFCC; MPEG-7 standard; Mel frequency cepstral coefficient; audio spectrum flatness feature; audio structural analysis; content-based audio information retrieval system; k-means clustering algorithm; music; pattern detection; self-similarity clue; speech clip; Algorithm design and analysis; Clustering algorithms; Content based retrieval; Information analysis; MPEG 7 Standard; Mel frequency cepstral coefficient; Music information retrieval; Pattern analysis; Speech analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
Type :
conf
DOI :
10.1109/ICME.2006.262675
Filename :
4036756
Link To Document :
بازگشت