• DocumentCode
    1342427
  • Title

    A multiband excited waveform-interpolated 2.35-kbps speech codec for bandlimited channels

  • Author

    Brooks, F.C.A. ; Hanzo, Lajos

  • Author_Institution
    Dept. of Electron. & Comput. Sci., Southampton Univ., UK
  • Volume
    49
  • Issue
    3
  • fYear
    2000
  • fDate
    5/1/2000 12:00:00 AM
  • Firstpage
    766
  • Lastpage
    777
  • Abstract
    Following a brief portrayal of the activities in 2.4-kbps speech coding, a wavelet-based pitch detector is invoked, which reduces the complexity of conventional autocorrelation-based pitch detectors, while ensuring smooth pitch trajectory evolution. This scheme is incorporated in a waveform-interpolated codec, which uses voiced-unvoiced (V/U) classification, and instead of simple Dirac pulses, an unconventional zinc basis function excitation is employed for modeling the voiced excitation. The required zinc-function parameters are determined in an analysis-by-synthesis loop, and for the sake of smooth waveform evolution and reduced complexity, a focused search strategy and a few further suboptimum restrictions are imposed without seriously affecting the speech quality. This baseline codec operates at a rate of 1.9 kbps, but it suffers from slight buzziness during the periods of excessive voicing. This impediment is then mitigated by invoking a mixed V/U multiband excitation, which slightly increases the bit rate to 2.35 kbps due to the transmission of the 3-b voicing strength code in each of the three excitation bands
  • Keywords
    bandlimited communication; computational complexity; interpolation; parameter estimation; signal classification; signal detection; speech codecs; speech coding; telecommunication channels; wavelet transforms; 1.9 kbit/s; 2.35 kbit/s; 2.4 kbit/s; Dirac pulses; analysis-by-synthesis loop; autocorrelation-based pitch detectors; bandlimited channels; excitation bands; focused search strategy; mixed V/U multiband excitation; multiband excited waveform-interpolated speech codec; pitch estimation; reduced complexity pitch detector; smooth pitch trajectory evolution; speech coding; speech quality; voiced excitation; voiced-unvoiced classification; voicing strength code; wavelet-based pitch detector; zinc basis function excitation; zinc-function parameters; Bit rate; Detectors; Interpolation; Prototypes; Speech analysis; Speech codecs; Speech coding; Speech synthesis; Standardization; Zinc;
  • fLanguage
    English
  • Journal_Title
    Vehicular Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9545
  • Type

    jour

  • DOI
    10.1109/25.845096
  • Filename
    845096