Title :
A semi-continuous state transition probability HMM-based voice activity detection
Author :
Othman, H. ; Abounasr, T.
Author_Institution :
Sch. of Inf. Technol. & Eng., Univ. of Ottawa, Ont., Canada
Abstract :
In this paper, we introduce an efficient hidden Markov model-based voice activity detection (VAD) algorithm with time-variant state transition probabilities in the underlying Markov chain. The transition probabilities vary in an exponential charge/discharge scheme and are softly merged with state conditional likelihood into a final VAD decision. Working in the domain of ITU-T G.729 parameters, with no additional cost for feature extraction, the proposed algorithm significantly outperforms G.729 Annex B VAD while providing a balanced tradeoff between clipping and false detection errors. The performance compares very favorably with adaptive multirate VAD, phase 2 (AMR2).
Keywords :
feature extraction; hidden Markov models; speech processing; HMM-based voice activity detection; ITU-T G.729; Markov chain; VAD decision; adaptive multirate VAD; clipping; exponential charge/discharge scheme; false detection errors; feature extraction; hidden Markov model; semi-continuous state transition probability; state conditional likelihood; time-variant state transition probabilities; Background noise; Communication standards; Costs; Echo cancellers; Feature extraction; Hidden Markov models; Information technology; Noise cancellation; Noise generators; Speech enhancement;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1327237