Title :
Wideband speech coding with toll quality based on IA-model
Author :
Ng, Ling Kok ; Li, Gang ; Lin, Xiao ; Bi, Guoan
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
Abstract :
We propose an instantaneous amplitude (IA) based model for speech signal representation. This can avoid the difficulty in dealing with the time-varying phases and allows us to perform an optimization procedure easily such that the synthetic signal can be made as close to the original one as possible. A simplified frequency picking algorithm is derived to shorten the processing time while still maintaining the quality of the synthetic speech. Experiments show that the synthetic speech with the developed technique is of toll quality and almost perceptually indistinguishable from the original speech. Initial work on the coding of the parameters, for a 16 kHz sampled speech, for the IA model is done and a toll quality synthesized speech at a bit rate of 40 kbps is achieved
Keywords :
optimisation; signal representation; signal sampling; speech coding; speech intelligibility; speech synthesis; 16 kHz; 40 kbit/s; IA-model; bit rate; experiments; frequency picking algorithm; instantaneous amplitude; optimization procedure; processing time; sampled speech; speech signal representation; synthetic signal; synthetic speech quality; toll quality; wideband speech coding; Frequency estimation; Polynomials; Signal processing; Signal synthesis; Speech analysis; Speech coding; Speech processing; Speech synthesis; Vocoders; Wideband;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758093