Title :
Voicing detection in DAP-STC
Author :
Ho, M.S. ; Molyneux, D.J. ; Cheetham, B.M.G.
Author_Institution :
Dept. of Comput. Sci., Manchester Univ., UK
Abstract :
Sinusoidal transform coding (STC) requires an all-pole representation of spectra derived periodically from the short-term speech spectral envelope and a “voicing probability” frequency fv to divide each spectrum into two sub-bands: voiced below fv and unvoiced above fv. Discrete all-pole (DAP) modeling may be applied to STC to improve the accuracy of the short-term spectral envelope for voiced speech with modifications to accommodate unvoiced speech and spectra which do not conform well to an all-pole model. This paper presents a novel approach to the determination of fv which is appropriate when DAP is employed. It is a frequency-domain algorithm with an analysis-by-synthesis optimisation process. This approach improves the accuracy of DAP-STC modeled speech
Keywords :
frequency-domain analysis; optimisation; poles and zeros; probability; signal representation; spectral analysis; speech coding; speech synthesis; transform coding; DAP-STC modeled speech; all-pole spectra representation; analysis-by-synthesis optimisation; discrete all-pole modeling; frequency-domain algorithm; short-term speech spectral envelope; sinusoidal transform coding; speech coding; unvoiced speech band; voiced speech band; voicing detection; voicing probability frequency; Algorithm design and analysis; Computer science; Digital audio players; Frequency domain analysis; Frequency estimation; Power harmonic filters; Resonance; Speech analysis; Speech coding; Transform coding;
Conference_Titel :
Speech Coding, 2000. Proceedings. 2000 IEEE Workshop on
Conference_Location :
Delavan, WI
Print_ISBN :
0-7803-6416-3
DOI :
10.1109/SCFT.2000.878386