DocumentCode
1088492
Title
Quantization and bit allocation in speech processing
Author
Gray, Augustine H., Jr. ; Markel, John D.
Author_Institution
University of California, Santa Barbara, CA
Volume
24
Issue
6
fYear
1976
fDate
12/1/1976 12:00:00 AM
Firstpage
459
Lastpage
473
Abstract
The topic of quantization and bit allocation in speech processing is studied using an L2 norm. Closed-form expressions are derived for the root mean square (rms) spectral deviation due to variations in one, two, or multiple parameters. For one-parameter variation, the reflection coefficients, log area ratios, and inverse sine coefficients are studied. It is shown that, depending upon the criterion chosen, either log area ratios or inverse sine quantization can be viewed as optimal. From a practical point of view, it is shown experimentally that very little difference exists among the various quantization methods beyond the second coefficient. Two-parameter variations are studied in terms of formant frequency and bandwidth movement and in terms of a two-pair quantization scheme. A lower bound on the number of quantization levels required to satisfy a given maximum spectral deviation is derived along with the two-pair quantization scheme which approximately satisfies the bound. It is shown theoretically that the two-pair quantization scheme has a 10-bit superiority over other above-mentioned quantization schemes in the sense of theoretically assuring that a maximum overall log spectral deviation will not be exceeded.
Keywords
Acoustic reflection; Bit rate; Closed-form solution; Dictionaries; Quantization; Speech processing; Speech synthesis; Springs; Stress; Testing;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1976.1162857
Filename
1162857
Link To Document