DocumentCode
1342427
Title
A multiband excited waveform-interpolated 2.35-kbps speech codec for bandlimited channels
Author
Brooks, F.C.A. ; Hanzo, Lajos
Author_Institution
Dept. of Electron. & Comput. Sci., Southampton Univ., UK
Volume
49
Issue
3
fYear
2000
fDate
5/1/2000 12:00:00 AM
Firstpage
766
Lastpage
777
Abstract
Following a brief portrayal of the activities in 2.4-kbps speech coding, a wavelet-based pitch detector is invoked, which reduces the complexity of conventional autocorrelation-based pitch detectors, while ensuring smooth pitch trajectory evolution. This scheme is incorporated in a waveform-interpolated codec, which uses voiced-unvoiced (V/U) classification, and instead of simple Dirac pulses, an unconventional zinc basis function excitation is employed for modeling the voiced excitation. The required zinc-function parameters are determined in an analysis-by-synthesis loop, and for the sake of smooth waveform evolution and reduced complexity, a focused search strategy and a few further suboptimum restrictions are imposed without seriously affecting the speech quality. This baseline codec operates at a rate of 1.9 kbps, but it suffers from slight buzziness during the periods of excessive voicing. This impediment is then mitigated by invoking a mixed V/U multiband excitation, which slightly increases the bit rate to 2.35 kbps due to the transmission of the 3-b voicing strength code in each of the three excitation bands
Keywords
bandlimited communication; computational complexity; interpolation; parameter estimation; signal classification; signal detection; speech codecs; speech coding; telecommunication channels; wavelet transforms; 1.9 kbit/s; 2.35 kbit/s; 2.4 kbit/s; Dirac pulses; analysis-by-synthesis loop; autocorrelation-based pitch detectors; bandlimited channels; excitation bands; focused search strategy; mixed V/U multiband excitation; multiband excited waveform-interpolated speech codec; pitch estimation; reduced complexity pitch detector; smooth pitch trajectory evolution; speech coding; speech quality; voiced excitation; voiced-unvoiced classification; voicing strength code; wavelet-based pitch detector; zinc basis function excitation; zinc-function parameters; Bit rate; Detectors; Interpolation; Prototypes; Speech analysis; Speech codecs; Speech coding; Speech synthesis; Standardization; Zinc;
fLanguage
English
Journal_Title
Vehicular Technology, IEEE Transactions on
Publisher
ieee
ISSN
0018-9545
Type
jour
DOI
10.1109/25.845096
Filename
845096
Link To Document