DocumentCode
336752
Title
Wideband speech coding with toll quality based on IA-model
Author
Ng, Ling Kok ; Li, Gang ; Lin, Xiao ; Bi, Guoan
Author_Institution
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
Volume
1
fYear
1999
fDate
15-19 Mar 1999
Firstpage
185
Abstract
We propose an instantaneous amplitude (IA) based model for speech signal representation. This can avoid the difficulty in dealing with the time-varying phases and allows us to perform an optimization procedure easily such that the synthetic signal can be made as close to the original one as possible. A simplified frequency picking algorithm is derived to shorten the processing time while still maintaining the quality of the synthetic speech. Experiments show that the synthetic speech with the developed technique is of toll quality and almost perceptually indistinguishable from the original speech. Initial work on the coding of the parameters, for a 16 kHz sampled speech, for the IA model is done and a toll quality synthesized speech at a bit rate of 40 kbps is achieved
Keywords
optimisation; signal representation; signal sampling; speech coding; speech intelligibility; speech synthesis; 16 kHz; 40 kbit/s; IA-model; bit rate; experiments; frequency picking algorithm; instantaneous amplitude; optimization procedure; processing time; sampled speech; speech signal representation; synthetic signal; synthetic speech quality; toll quality; wideband speech coding; Frequency estimation; Polynomials; Signal processing; Signal synthesis; Speech analysis; Speech coding; Speech processing; Speech synthesis; Vocoders; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.758093
Filename
758093
Link To Document