Title :
Histogram Equalization-Based Features for Speech, Music, and Song Discrimination
Author :
Gallardo-Antolín, Ascensión ; Montero, Juan M.
Author_Institution :
Dept. of Signal Theor. & Commun., Univ. Carlos III de Madrid, Leganes, Spain
fDate :
7/1/2010 12:00:00 AM
Abstract :
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
Keywords :
audio signal processing; speech processing; PHEQ; feature distribution; mel frequency cepstrum coefficients; music discrimination; polynomial fit histogram equalization; song discrimination; speech discrimination; Acoustic features; HEQ-based features; audio classification; parameterization; speech/music/song discrimination;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2010.2049877