DocumentCode
3054794
Title
A variable-order Markov chain for coding of speech spectra
Author
Roucos, S. ; Makhoul, J. ; Schwartz, H.
Author_Institution
Bolt Beranek and Newman Inc., Cambridge, MA
Volume
7
fYear
1982
fDate
30072
Firstpage
582
Lastpage
585
Abstract
We present a method that reduces the bit rate of a low rate LPC vocoder by modelling the sequence of quantized spectra by a Markov chain. To minimize the bit rate, one would want to use a high-order chain. Unfortunately, a high-order chain would require an inordinate amount of data for training. We describe in this paper the use of a variable order Markov chain that maximizes the effective use of a given amount of speech data. To reduce the number of states of a high-order chain, we define an equivalence relation on the states, i.e., "similar" states are grouped together in an equivalence class and a single conditional distribution is associated with the equivalence class. We introduce two equivalence relations. In the first, called variable order Markov chain, the equivalence classes represent the most probable states of any order. In the second method, called variable resolution, the equivalence class is obtained by decreasing the quantization accuracy in representing a spectrum that belongs to a more remote past. For an LPC vocoder with 64 possible spectra (using 6-bit vector quantization), the second method is superior to the first and decreases the entropy from 6 bits to 4 bits per spectrum with 256 equivalence classes.
Keywords
Bit rate; Encoding; Entropy; Fasteners; Linear predictive coding; Speech coding; Timing; Training data; Vector quantization; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type
conf
DOI
10.1109/ICASSP.1982.1171652
Filename
1171652
Link To Document