مرکز منطقه ای اطلاع رساني علوم و فناوري - Spectrogram based features selection using multiple kernel learning for speech/music discrimination

DocumentCode :

3144981

Title :

Spectrogram based features selection using multiple kernel learning for speech/music discrimination

Author :

Nilufar, Sharmin ; Ray, Nilanjan ; Molla, M. K Islam ; Hirose, Keikichi

Author_Institution :

Univ. of Alberta, Edmonton, AB, Canada

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

501

Lastpage :

504

Abstract :

This paper presents a multiple kernel learning (MKL) approach to speech/music discrimination (SMD). The time-frequency representation (spectrogram) implemented by short-time Fourier transform (STFT) of audio segment is decomposed by wavelet packet transform into different subband levels. The subbands, which contain rich texture information, are used as features for this discrimination problem. MKL technique is used to select the optimal subbands to discriminate the audio signals. The proposed MKL based algorithm is applied for SMD of a standard dataset. The experimental results show that the proposed technique yields noticeable improvements in classification accuracy and tolerance toward different noise types compared to the existing methods.

Keywords :

Fourier transforms; audio signal processing; feature extraction; learning (artificial intelligence); music; signal classification; speech processing; time-frequency analysis; wavelet transforms; MKL approach; SMD; STFT; audio segment decomposition; audio signals; multiple kernel learning approach; optimal subband levels; short-time Fourier transform; spectrogram based feature selection; speech-music discrimination; texture information; time-frequency representation; wavelet packet transform; Kernel; Spectrogram; Speech; Time frequency analysis; Wavelet packets; Multiple kernel learning; spectrogram; speech/music discrimination; wavelet packet transform;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6287926

Filename :

6287926

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3144981