DocumentCode
173203
Title
Speech description through MINERS: Model Invariant to Noise and Environment Robust for Speech
Author
Viana, H.O. ; Mello, C.A.B.
Author_Institution
Centro de Inf., Univ. Fed. de Pernambuco, Recife, Brazil
fYear
2014
fDate
5-8 Oct. 2014
Firstpage
489
Lastpage
494
Abstract
This paper presents a new speech descriptor called MINERS. It uses image analysis, Mel-Frequency Cepstrum Coefficients (MFCC), and a combination of Wavelet denoising and Power Normalized Cepstral Coefficients (PNCC) Temporal Masking. The purpose of this descriptor is to make easier feature extraction for speech. The descriptor is invariant to noise, environment, and speaker. For evaluation, NOIZEUS database was used for speech recognition through two classifiers: Support Vector Machine (SVM) and Hidden Markov Model (HMM). MINERS has better results among all other evaluated descriptors. The most successful approach was obtained using MINERS with a SVM classifier.
Keywords
cepstral analysis; feature extraction; hidden Markov models; image classification; image denoising; speech recognition; support vector machines; HMM; MFCC; MINERS; NOIZEUS database; PNCC temporal masking; SVM classifier; feature extraction; hidden Markov model; image analysis; mel-frequency cepstrum coefficients; noise; power normalized cepstral coefficients; speech description; speech recognition; support vector machine; wavelet denoising; Hidden Markov models; Mel frequency cepstral coefficient; Noise measurement; Noise reduction; Signal to noise ratio; Speech; Image Analysis; MFCC; PNCC; Speech Descriptors; Speech Recognition; Wavelets;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics (SMC), 2014 IEEE International Conference on
Conference_Location
San Diego, CA
Type
conf
DOI
10.1109/SMC.2014.6973955
Filename
6973955
Link To Document