Title :
Research on voice activity detection in burst and partial duration noisy environment
Author :
Guo, Chunyi ; Li, Runzhi ; Fan, Ming ; Liu, Kejun
Author_Institution :
Sch. of Inf. Eng., Zhengzhou Univ., Zhengzhou, China
Abstract :
Voice activity detection aims at detecting speech in noisy environment and is very important for speech recognition. In this paper, two novel methods are proposed for finding out burst and partial duration noisy signals in order to detect real speech from non-stationary noise and to improve the performance of continuous Mandarin speech recognition system. For the burst and low-energy noise adjacent to speech segment, a method using initial and final part of Chinese syllable is applied to detect accurate endpoints of speech based on traditional short-time energy and zero-crossing rates. For the isolated noise with relatively high energy, a short-time energy zero product based method is used. Both of the methods use time-domain features and have low computational complexity. Experimental results show that the system using proposed methods has improved accuracy in voice activity detection.
Keywords :
computational complexity; feature extraction; natural language processing; signal detection; speech recognition; time-domain analysis; Chinese syllable; burst duration noisy environment; computational complexity; continuous Mandarin speech recognition system; nonstationary noise; partial duration noisy environment; short-time energy zero product based method; speech endpoint detection; time-domain features; voice activity detection; zero-crossing rates; Hidden Markov models; Noise measurement; Signal to noise ratio; Speech; Speech processing; Speech recognition;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376759