Title :
Learning optimal features for music transcription
Author :
Huaiping Ming ; Dongyan Huang ; Lei Xie ; Haizhou Li
Author_Institution :
Sch. of Comput. Sci., Northwestern Polytech. Univ., Xi´an, China
Abstract :
This paper aims to design time-frequency representation (TFR) functions for automatic music transcription. It is desirable that the decomposition of those TFR functions are suitable for notes having variation of both pitch and spectral envelop over time. The Harmonic Adaptive Latent Component Analysis (HALCA) model adopted in this paper allows considering those two kinds of variations simultaneously. We evaluate the influence of three TFR functions including IIR, FIR filter bank semigram (FBSG) and constant-Q transform semigram in automatic music transcription task, on a database of popular and polyphonic classic music. The experiment results show that the filter bank based representations are suitable for multiple-instrument recordings and a CQT-based representation turns out to provide very accurate transcription for solo-instrument recordings.
Keywords :
FIR filters; channel bank filters; harmonic analysis; learning (artificial intelligence); music; signal representation; time-frequency analysis; transforms; CQT-based representation; FBSG; FIR filter bank semigram; HALCA model; TFR functions; automatic music transcription; constant-Q transform semigram; filter bank based representations; harmonic adaptive latent component analysis; multiple-instrument recordings; optimal feature learning; pitch envelop; polyphonic classic music; solo-instrument recordings; spectral envelop; time-frequency representation function; Estimation; Feature extraction; Finite impulse response filters; Frequency estimation; Instruments; Speech; Speech processing; Semigram features; constant-Q transform; filter bank; logarithmic compression; music transcription;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
DOI :
10.1109/ChinaSIP.2014.6889211