DocumentCode
3334687
Title
A voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin
Author
Yinfeng Wang ; Shaoguang Huang ; Ying Wei
Author_Institution
Sch. of Inf. Sci. & Eng., Shandong Univ., Jinan, China
Volume
03
fYear
2013
fDate
16-18 Dec. 2013
Firstpage
1287
Lastpage
1291
Abstract
Voice activity detection algorithms are widely used in the areas of voice compression, speech synthesis, speech recognition, speech enhancement, and etc. In this paper, an efficient voice activity detection algorithm with sub-band detection based on time-frequency characteristics of mandarin is proposed. The proposed sub-band detection consists of two parts: crosswise detection and lengthwise detection. Energy detection and pitch detection are in the range of considerations. For a better performance, double-threshold criterion is used to reduce the misjudgment rate of the detection. Performance evaluation is based on six noise environments with different SNRs. Experiment results indicate that the proposed algorithm can detect the area of voice effectively in non-stationary environment and low SNR environment and has the potential to progress.
Keywords
speech enhancement; speech recognition; speech synthesis; time-frequency analysis; Mandarin; crosswise detection; double-threshold criterion; energy detection; lengthwise detection; pitch detection; speech enhancement; speech recognition; speech synthesis; time-frequency characteristics; voice activity detection algorithm; voice compression; Accuracy; Acoustics; Filter banks; Signal to noise ratio; Speech; Time-frequency analysis; VAD; mandarin; pitch; sub-band detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Signal Processing (CISP), 2013 6th International Congress on
Conference_Location
Hangzhou
Print_ISBN
978-1-4799-2763-0
Type
conf
DOI
10.1109/CISP.2013.6743871
Filename
6743871
Link To Document