Title :
Perceptually motivated quasi-periodic signal selection for polyphonic music transcription
Author :
Triki, Mahdi ; Slock, Dirk T M
Author_Institution :
Philips Res. Labs., Eindhoven
Abstract :
A multiple fundamental frequency estimator is a key building block in music transcription and indexing operations. However, systems trying to perform this task tend to be very complex. Indeed, music transcription requires an analysis accounting for both physical and psycho-acoustical matters. In this work, we propose a physically-motivated audio signal analysis followed by an auditory-based selection. The audio signal model allows for a better time/frequency resolution tradeoff, while the auditory distance discards the redundant/non-relevant information. No prior information on the musical instrument, musical genre, and/or maximum polyphony are needed. Simulations show that the proposed technique achieves good transcription results for a variety of string and wind instruments. The proposed scheme is also shown to be robust in the presence of noise, percussive sounds and in unbalanced signal-to-interference ratio (SIR) situations.
Keywords :
acoustic signal processing; audio signal processing; audio signal analysis; auditory-based selection; indexing operations; multiple fundamental frequency estimator; perceptually motivated quasi-periodic signal selection; polyphonic music transcription; unbalanced signal-to-interference ratio situations; Acoustic noise; Frequency estimation; Indexing; Instruments; Multiple signal classification; Noise robustness; Psychoacoustic models; Psychology; Signal analysis; Signal resolution; amplitude modulation; frequency-selective; music transcription; perceptual model; pitch recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959581