DocumentCode :
336752
Title :
Wideband speech coding with toll quality based on IA-model
Author :
Ng, Ling Kok ; Li, Gang ; Lin, Xiao ; Bi, Guoan
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
Volume :
1
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
185
Abstract :
We propose an instantaneous amplitude (IA) based model for speech signal representation. This can avoid the difficulty in dealing with the time-varying phases and allows us to perform an optimization procedure easily such that the synthetic signal can be made as close to the original one as possible. A simplified frequency picking algorithm is derived to shorten the processing time while still maintaining the quality of the synthetic speech. Experiments show that the synthetic speech with the developed technique is of toll quality and almost perceptually indistinguishable from the original speech. Initial work on the coding of the parameters, for a 16 kHz sampled speech, for the IA model is done and a toll quality synthesized speech at a bit rate of 40 kbps is achieved
Keywords :
optimisation; signal representation; signal sampling; speech coding; speech intelligibility; speech synthesis; 16 kHz; 40 kbit/s; IA-model; bit rate; experiments; frequency picking algorithm; instantaneous amplitude; optimization procedure; processing time; sampled speech; speech signal representation; synthetic signal; synthetic speech quality; toll quality; wideband speech coding; Frequency estimation; Polynomials; Signal processing; Signal synthesis; Speech analysis; Speech coding; Speech processing; Speech synthesis; Vocoders; Wideband;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.758093
Filename :
758093
Link To Document :
بازگشت