DocumentCode :
1851016
Title :
1-D Local binary patterns based VAD used INHMM-based improved speech recognition
Author :
Zhu, Qiming ; Chatlani, Navin ; Soraghan, John J.
Author_Institution :
Centre for Excellence in Signal & Image Process. (CeSIP), Univ. of Strathclyde, Glasgow, UK
fYear :
2012
fDate :
27-31 Aug. 2012
Firstpage :
1633
Lastpage :
1637
Abstract :
In this paper, 1-D Local binary patterns (LBP) are proposed to be used in speech signal segmentation and voice activation detection (VAD)and combined with hidden Markov model (HMM) for advanced speech recognition. Speech is firstly de-noised by Adaptive Empirical Model Decomposition (AEMD), and then processed using LBP based VAD. The short-time energy of the speech activity detected from the VAD is finally smoothed and used as the input of the HMM recognition process. The enhanced performance of the proposed system for speech recognition is compared with other VAD techniques at different SNRs ranging from 15 dB to a robust noisy condition at -5 dB.
Keywords :
hidden Markov models; signal denoising; speech recognition; 1-D LBP; 1D local binary pattern; AEMD; INHMM-based improved speech recognition; SNR; VAD technique; adaptive empirical model decomposition; hidden Markov model; noise figure -5 dB; noise figure 15 dB; speech activity detection; speech denoising; speech signal segmentation; voice activation detection; Hidden Markov models; Noise; Noise measurement; Speech; Speech enhancement; Speech recognition; Hidden Markov Model; Local Binary Patterns; Noise Reduction; Speech Enhancement using Adaptive Empirical Model Decomposition (AEMD); Voice Activity Detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
ISSN :
2219-5491
Print_ISBN :
978-1-4673-1068-0
Type :
conf
Filename :
6334020
Link To Document :
بازگشت