DocumentCode :
3360889
Title :
The two-dimensional discrete cosine transform applied to speech data
Author :
Baghai-Ravary, L. ; Beet, S.W. ; Tokhi, M.O.
Author_Institution :
Dept. of Electron. & Electr. Eng., Sheffield Univ., UK
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
244
Abstract :
A two-dimensional discrete cosine transform (2-D DCT), often used for image coding, has been applied to sequences of speech spectra produced by the maximum likelihood method (MLM). The coded data was compressed by nearly 90%, reducing it to a size smaller than that needed to store the coefficients of a 10th order linear predictive coding (LPC) model. The DCT-encoded data was then reconstructed and tested for intelligibility. It was found that the two-dimensional DCT method was significantly more intelligible and more natural-sounding than the LPC technique
Keywords :
discrete cosine transforms; maximum likelihood estimation; speech coding; speech intelligibility; transform coding; 2D DCT; DCT encoded data reconstruction; LPC; data compression; linear predictive coding; maximum likelihood method; speech coding; speech data; speech intelligibility; speech spectra sequences; two-dimensional discrete cosine transform; Discrete cosine transforms; Discrete transforms; Filters; Image coding; Image reconstruction; Linear predictive coding; Spectrogram; Speech coding; Technological innovation; Two dimensional displays;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540403
Filename :
540403
Link To Document :
بازگشت