DocumentCode :
1485692
Title :
Histogram Equalization-Based Features for Speech, Music, and Song Discrimination
Author :
Gallardo-Antolín, Ascensión ; Montero, Juan M.
Author_Institution :
Dept. of Signal Theor. & Commun., Univ. Carlos III de Madrid, Leganes, Spain
Volume :
17
Issue :
7
fYear :
2010
fDate :
7/1/2010 12:00:00 AM
Firstpage :
659
Lastpage :
662
Abstract :
In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
Keywords :
audio signal processing; speech processing; PHEQ; feature distribution; mel frequency cepstrum coefficients; music discrimination; polynomial fit histogram equalization; song discrimination; speech discrimination; Acoustic features; HEQ-based features; audio classification; parameterization; speech/music/song discrimination;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2010.2049877
Filename :
5460954
Link To Document :
بازگشت