DocumentCode :
312013
Title :
An application of recurrent neural networks to low bit rate speech coding
Author :
Kohata, Minoru
Author_Institution :
Fac. of Eng., Tohoku Univ., Sendai, Japan
Volume :
1
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
314
Abstract :
It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows good performance for spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20~30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality
Keywords :
interpolation; linear predictive coding; recurrent neural nets; speech coding; LSP coefficient; linear prediction coefficients; low bit rate speech coding; recurrent neural networks; spectral interpolation; speech spectrum envelope; synthesized speech quality; Bit rate; Degradation; Delay effects; Ear; Interpolation; Network synthesis; Neural networks; Recurrent neural networks; Speech coding; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607116
Filename :
607116
Link To Document :
بازگشت