Title :
Application of speaker modification techniques to phonetic vocoding
Author :
Ribeiro, Carlos M. ; Trancoso, Isabel M.
Author_Institution :
INESC, Lisbon, Portugal
Abstract :
The goal of the work described in the paper is to develop a very low bit rate vocoding scheme. The vocoder is a typical LPC vocoder, whose parameters are post-processed on a phone-by-phone basis, resulting in a variable bit rate segment vocoder. Given the well known speaker recognizability problems presented by vocoders at such low bit rates, the authors have attempted to integrate a speaker modification method based on altering the formant frequencies and bandwidths of vowel segments. This is done by transmitting the mean value and standard deviation of the radius and angle of the poles corresponding to formant frequencies for each phone. In the decoder stage, the phone index is used to retrieve a set of normalized values from a codebook of `typical´ phones. This set is speaker adapted to preserve the static characteristics (average and standard deviation) but relies in the typical phone to represent the dynamic characteristics such as formant trajectories
Keywords :
decoding; linear predictive coding; speech coding; variable rate codes; vocoders; LPC vocoder; codebook; decoder stage; dynamic characteristics; formant bandwidth alteration; formant frequency alteration; normalized value retrieval; phone index; phone-by-phone parameter post-processing; phonetic vocoding; pole angle; pole radius; speaker modification techniques; speaker recognizability problems; static characteristics; variable bit rate segment vocoder; very low bit rate vocoding scheme; vowel segments; Bandwidth; Bit rate; Frequency; Hidden Markov models; Linear predictive coding; Loudspeakers; Polynomials; Speech recognition; Statistics; Vocoders;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607114