Title :
Adaptive pitch period decimation and its application in speech compression
Author :
Logan, Joseph ; Gowdy, John
Author_Institution :
Speech Process. Group, Clemson Univ., SC, USA
Abstract :
This paper presents a new method of speech coding that takes advantage of the repetitiveness inherent in voiced speech. Voiced speech is broken into pitch period lengths (wavelets) and these signals are compared with one another to determine If two wavelets differ significantly. If the wavelets are significantly different, then they are encoded and transmitted; otherwise, the current wavelet Is not transmitted, and the next pitch period wavelet is compared. This results in encoding only a representative fraction of the speech signal and significantly lowers the number of bits required to transmit the signal. Pitch period determination is done by using the autocorrelation method and a median smoothing filter. The pitch period wavelets are preprocessed using a time weighted averaging method that allows concatenation of wavelets without sharp transitions at pitch boundaries, therefore reducing high frequency noise. Wavelets are compared using the Itakura distance measure, which is usually employed in speech recognition applications. The transmitted wavelets are encoded using a differential PCM method to further reduce the bit rate of the transmission. Unvoiced speech is encoded using an LPC method on a frame by frame basis. This results in high quality speech transmission at bit rates of approximately 3.8 kb/s
Keywords :
adaptive codes; channel capacity; correlation methods; data compression; differential pulse code modulation; linear predictive coding; median filters; smoothing methods; speech coding; transform coding; wavelet transforms; 3.8 kbit/s; Itakura distance measure; LPC method; adaptive pitch period decimation; autocorrelation method; bit rate; concatenation; differential PCM; high frequency noise; median smoothing filter; pitch period lengths; repetitiveness; speech coding; speech compression; speech signal; speech transmission; time weighted averaging method; transmitted wavelet; unvoiced speech; voiced speech; wavelets; Autocorrelation; Bit rate; Encoding; Filters; Frequency; Noise reduction; Phase change materials; Smoothing methods; Speech coding; Speech recognition;
Conference_Titel :
Southeastcon '96. Bringing Together Education, Science and Technology., Proceedings of the IEEE
Conference_Location :
Tampa, FL
Print_ISBN :
0-7803-3088-9
DOI :
10.1109/SECON.1996.510060