Title :
Structured sparsity for automatic music transcription
Author :
O´Hanlon, Ken ; Nagano, Hidehisa ; Plumbley, Mark D.
Author_Institution :
Centre for Digital Music, Queen Mary Univ. of London, UK
Abstract :
Sparse representations have previously been applied to the automatic music transcription (AMT) problem. Structured sparsity, such as group and molecular sparsity allows the introduction of prior knowledge to sparse representations. Molecular sparsity has previously been proposed for AMT, however the use of greedy group sparsity has not previously been proposed for this problem. We propose a greedy sparse pursuit based on nearest subspace classification for groups with coherent blocks, based in a non-negative framework, and apply this to AMT. Further to this, we propose an enhanced molecular variant of this group sparse algorithm and demonstrate the effectiveness of this approach.
Keywords :
greedy algorithms; signal representation; automatic music transcription; coherent blocks; greedy group sparsity; group sparse algorithm; nonnegative framework; sparse representations; structured sparsity; subspace classification; Approximation algorithms; Approximation methods; Artificial neural networks; Dictionaries; Encoding; Matching pursuit algorithms; Measurement; Transcription; non-negative; structured sparsity;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287911