DocumentCode :
542250
Title :
Gaussian mixture model based mutual information estimation between frequency bands in speech
Author :
Nilsson, Mattias ; Gustaftson, Harald ; Andersen, Soren Vang ; Kleijn, W. Bastiaan
Author_Institution :
Dept. of Speech, Music and Hearing, KTH (Royal Institute of Technology), SE- I00 44 Stockholm, Sweden
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In this paper, we investigate the dependency between the spectral envelopes of speech in disjoint frequency bands, one covering the telephone bandwidth from 0.3 kHz to 3.4 kHz and one covering the frequencies from 3.7 kHz to 8 kHz. The spectral envelopes are jointly modeled with a Gaussian mixture model based on mel-frequency cepstral coefficients and the log-energy-ratio of the disjoint frequency bands. Using this model, we quantify the dependency between bands through their mutual information and the perceived entropy of the high frequency band. Our results indicate that the mutual information is only a small fraction of the perceived entropy of the high band. This suggests that speech bandwidth extension should not rely only on mutual information between narrow- and high-band spectra. Rather, such methods need to make use´ of perceptual properties to ensure that the extended signal sounds pleasant.
Keywords :
Acoustic distortion; Distortion measurement; Estimation; Frequency estimation; Narrowband; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743770
Filename :
5743770
Link To Document :
بازگشت