DocumentCode
763406
Title
Sparse and shift-Invariant representations of music
Author
Blumensath, Thomas ; Davies, Mike
Author_Institution
Digital Signal Process. Group, Univ. of London, UK
Volume
14
Issue
1
fYear
2006
Firstpage
50
Lastpage
57
Abstract
Redundancy reduction has been proposed as the main computational process in the primary sensory pathways in the mammalian brain. This idea has led to the development of sparse coding techniques, which are exploited in this article to extract salient structure from musical signals. In particular, we use a sparse coding formulation within a generative model that explicitly enforces shift-invariance. Previous work has applied these methods to relatively small problem sizes. In this paper, we present a subset selection step to reduce the computational complexity of these methods, which then enables us to use the sparse coding approach for many real world applications. We demonstrate the algorithm´s potential on two tasks in music analysis: the extraction of individual notes from polyphonic piano music and single-channel blind source separation.
Keywords
acoustic signal processing; blind source separation; codes; computational complexity; feature extraction; music; signal representation; computational complexity; mammalian brain; musical signal; polyphonic piano music; primary sensory pathways; redundancy reduction; salient structure; shift-invariant music representations; single-channel blind source separation; sparse coding techniques; sparse music representations; Algorithm design and analysis; Blind source separation; Computational complexity; Dictionaries; Humans; Independent component analysis; Multiple signal classification; Signal processing algorithms; Source separation; Unsupervised learning; Blind source separation; independent component analysis (ICA); shift-invariance; sparse coding; time–series analysis; unsupervised learning;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.860346
Filename
1561263
Link To Document