DocumentCode :
1037616
Title :
Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection
Author :
Dubnov, Shlomo
Author_Institution :
Univ. of California at San Diego, La Jolla, CA
Volume :
16
Issue :
2
fYear :
2008
Firstpage :
327
Lastpage :
337
Abstract :
In this paper, we present a new method for analysis of musical structure that captures local prediction and global repetition properties of audio signals in one information processing framework. The method is motivated by a recent work in music perception where machine features were shown to correspond to human judgments of familiarity and emotional force when listening to music. Using a notion of information rate in a model-based framework, we develop a measure of mutual information between past and present in a time signal and show that it consist of two factors - prediction property related to data statistics within an individual block of signal features, and repetition property based on differences in model likelihood across blocks. The first factor, when applied to spectral representation of audio signals, is known as spectral anticipation, and the second factor is known as recurrence analysis. We present algorithms for estimation of these measures and create a visualization that displays their temporal structure in musical recordings. Considering these features as a measure of the amount of information processing that a listening system performs on a signal, information rate is used to detect interest points in music. Several musical works with different performances are analyzed in this paper, and their structure and interest points are displayed and discussed. Extensions of this approach towards a general framework of characterizing machine listening experience are suggested.
Keywords :
audio signal processing; musical acoustics; pattern clustering; audio signals; information processing; interest point detection; music perception; recurrence analysis; repetition structure; spectral anticipation; Humans; Information analysis; Information processing; Information rates; Mutual information; Predictive models; Signal analysis; Signal processing; Statistics; Time measurement; Information rate; interest points; musical structure; recurrence matrix; spectral anticipation; spectral clustering; visualization;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.912378
Filename :
4432637
Link To Document :
بازگشت