• DocumentCode
    2202658
  • Title

    Adaptive pitch period decimation and its application in speech compression

  • Author

    Logan, Joseph ; Gowdy, John

  • Author_Institution
    Speech Process. Group, Clemson Univ., SC, USA
  • fYear
    1996
  • fDate
    11-14 Apr 1996
  • Firstpage
    220
  • Lastpage
    222
  • Abstract
    This paper presents a new method of speech coding that takes advantage of the repetitiveness inherent in voiced speech. Voiced speech is broken into pitch period lengths (wavelets) and these signals are compared with one another to determine If two wavelets differ significantly. If the wavelets are significantly different, then they are encoded and transmitted; otherwise, the current wavelet Is not transmitted, and the next pitch period wavelet is compared. This results in encoding only a representative fraction of the speech signal and significantly lowers the number of bits required to transmit the signal. Pitch period determination is done by using the autocorrelation method and a median smoothing filter. The pitch period wavelets are preprocessed using a time weighted averaging method that allows concatenation of wavelets without sharp transitions at pitch boundaries, therefore reducing high frequency noise. Wavelets are compared using the Itakura distance measure, which is usually employed in speech recognition applications. The transmitted wavelets are encoded using a differential PCM method to further reduce the bit rate of the transmission. Unvoiced speech is encoded using an LPC method on a frame by frame basis. This results in high quality speech transmission at bit rates of approximately 3.8 kb/s
  • Keywords
    adaptive codes; channel capacity; correlation methods; data compression; differential pulse code modulation; linear predictive coding; median filters; smoothing methods; speech coding; transform coding; wavelet transforms; 3.8 kbit/s; Itakura distance measure; LPC method; adaptive pitch period decimation; autocorrelation method; bit rate; concatenation; differential PCM; high frequency noise; median smoothing filter; pitch period lengths; repetitiveness; speech coding; speech compression; speech signal; speech transmission; time weighted averaging method; transmitted wavelet; unvoiced speech; voiced speech; wavelets; Autocorrelation; Bit rate; Encoding; Filters; Frequency; Noise reduction; Phase change materials; Smoothing methods; Speech coding; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Southeastcon '96. Bringing Together Education, Science and Technology., Proceedings of the IEEE
  • Conference_Location
    Tampa, FL
  • Print_ISBN
    0-7803-3088-9
  • Type

    conf

  • DOI
    10.1109/SECON.1996.510060
  • Filename
    510060