Title :
Bandwidth Expansionwith a p??lya URN Model
Author :
Raj, Bhiksha ; Singh, Rajdeep ; Shashanka, Madhusudana ; Smaragdis, Paris
Author_Institution :
Mitsubishi Electr. Res. Labs, Cambridge, MA, USA
Abstract :
We present a new statistical technique for the estimation of the high frequency components (4-8 kHz) of speech signals from narrow-band (0-4 kHz) signals. The magnitude spectra of broadband speech are modelled as the outcome of a Polya Urn process, that represents the spectra as the histogram of the outcome of several draws from a mixture multinomial distribution over frequency indices. The multinomial distributions that compose this process are learnt from a corpus of broadband (0-8 kHz) speech. To estimate high-frequency components of narrow-band speech, its spectra are also modelled as the outcome of draws from a mixture-multinomial process that is composed of the learnt multinomials, where the counts of the indices of higher frequencies have been obscured. The obscured high-frequency components are then estimated as the expected number of draws of their indices from the mixture-multinomial. Experiments conducted on bandlimited signals derived from the WSJ corpus show that the proposed procedure is able to accurately estimate the high frequency components of these signals.
Keywords :
speech processing; statistical analysis; statistical distributions; 0 to 4 kHz; 4 to 8 kHz; Polya Urn model; bandlimited signals; bandwidth expansion; broadband speech signals; mixture multinomial distribution; narrow-band speech signals; statistical technique; Bandwidth; Frequency estimation; Hidden Markov models; Histograms; Narrowband; Random variables; Signal reconstruction; Signal restoration; Speech processing; US Department of Transportation; Signal reconstruction; Signal restoration; Speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366983