DocumentCode :
110569
Title :
Demodulation of Narrowband Speech Spectrograms Using the Riesz Transform
Author :
Aragonda, Haricharan ; Seelamantula, Chandra Sekhar
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Volume :
23
Issue :
11
fYear :
2015
fDate :
Nov. 2015
Firstpage :
1824
Lastpage :
1834
Abstract :
We propose a two-dimensional (2-D) multicomponent amplitude-modulation, frequency-modulation (AM-FM) model for a spectrogram patch corresponding to voiced speech, and develop a new demodulation algorithm to effectively separate the AM, which is related to the vocal tract response, and the carrier, which is related to the excitation. The demodulation algorithm is based on the Riesz transform and is developed along the lines of Hilbert-transform-based demodulation for 1-D AM-FM signals. We compare the performance of the Riesz transform technique with that of the sinusoidal demodulation technique on real speech data. Experimental results show that the Riesz-transform-based demodulation technique represents spectrogram patches accurately. The spectrograms reconstructed from the demodulated AM and carrier are inverted and the corresponding speech signal is synthesized. The signal-to-noise ratio (SNR) of the reconstructed speech signal, with respect to clean speech, was found to be 2 to 4 dB higher in case of the Riesz transform technique than the sinusoidal demodulation technique.
Keywords :
Hilbert transforms; amplitude modulation; demodulation; frequency modulation; signal reconstruction; signal synthesis; speech synthesis; 1D AM-FM signal; 2D multicomponent amplitude modulation; Hilbert transform-based demodulation; Riesz transform; frequency modulation model; narrowband speech spectrogram demodulation; signal-to-noise ratio; sinusoidal demodulation technique; spectrogram patch reconstruction; speech signal reconstruction; speech signal synthesis; vocal tract response; voiced speech; Demodulation; Narrowband; Spectrogram; Speech; Speech processing; Time-frequency analysis; Transforms; Amplitude modulation model of spectrograms; Riesz transform; grating compression transform (GCT); multiband AM-FM; sinusoidal demodulation; spectro-temporal analysis;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2449088
Filename :
7131474
Link To Document :
بازگشت