DocumentCode
1389177
Title
Auditory Spectrum-Based Pitched Instrument Onset Detection
Author
Benetos, Emmanouil ; Stylianou, Yannis
Author_Institution
Inst. of Comput. Sci., Found. for Res. & Technol.-Hellas (FORTH), Heraklion, Greece
Volume
18
Issue
8
fYear
2010
Firstpage
1968
Lastpage
1977
Abstract
In this paper, a method for onset detection of music signals using auditory spectra is proposed. The auditory spectrogram provides a time-frequency representation that employs a sound processing model resembling the human auditory system. Recent work on onset detection employs DFT-based features describing spectral energy and phase differences, as well as pitch-based features. These features are often combined for maximizing detection performance. Here, the spectral flux and phase slope features are derived in the auditory framework and a novel fundamental frequency estimation algorithm based on auditory spectra is introduced. An onset detection algorithm is proposed, which processes and combines the aforementioned features at the decision level. Experiments are conducted on a dataset covering 11 pitched instrument types, consisting of 1829 onsets in total. Results indicate that auditory representations outperform various state-of-the-art approaches, with the onset detection algorithm reaching an F-measure of 82.6%.
Keywords
audio signal processing; discrete Fourier transforms; music; DFT-based features; auditory spectra; auditory spectrum-based pitched instrument; music signals; onset detection; time-frequency representation; Auditory system; Computer vision; Detection algorithms; Humans; Instruments; Multiple signal classification; Phase detection; Signal detection; Spectrogram; Time frequency analysis; Auditory spectrum; group delay function; onset detection;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2040785
Filename
5393059
Link To Document