DocumentCode
180121
Title
A posteriori voiced/unvoiced probability estimation based on a sinusoidal model
Author
Rehr, Robert ; Krawczyk, Michal ; Gerkmann, Timo
Author_Institution
Dept. of Med. Phys. & Acoust., Univ. of Oldenburg, Oldenburg, Germany
fYear
2014
fDate
4-9 May 2014
Firstpage
6944
Lastpage
6948
Abstract
In this paper, we focus on methods for estimating the a posteriori probability of a signal segment being voiced which employ a harmonic signal model. Fisher et al. [1] present two likelihood functions for voiced and unvoiced speech from which the posterior probability can be derived. However, due to the chosen models, the a posteriori probability of a signal segment being voiced does not go to 0 % in unvoiced speech. Thus, a novel algorithm is proposed, which incorporates the expected unvoiced speech energy and allows for obtaining low probabilities. Further, it explicitly models the statistics of the segment energy and employs a state-of-the-art noise tracker. Experiments which were conducted on the TIMIT database for different noise types and noise levels show that the proposed method results in lower over-estimation and under-estimation of the voicing probability as compared to [1].
Keywords
acoustic noise; estimation theory; harmonic analysis; probability; speech processing; statistics; TIMIT database; a posteriori voiced-unvoiced probability estimation; harmonic signal model; likelihood functions; noise levels; noise tracker; noise types; segment energy; signal segment; sinusoidal model; statistics; unvoiced speech energy; voicing probability; Harmonic analysis; Noise measurement; Shape; Signal to noise ratio; Speech; Speech processing; a posteriori probability; harmonic model; likelihood ratio test; voiced-unvoiced decision; voicing determination;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854946
Filename
6854946
Link To Document