Title :
Demodulation of Narrowband Speech Spectrograms Using the Riesz Transform
Author :
Aragonda, Haricharan ; Seelamantula, Chandra Sekhar
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Abstract :
We propose a two-dimensional (2-D) multicomponent amplitude-modulation, frequency-modulation (AM-FM) model for a spectrogram patch corresponding to voiced speech, and develop a new demodulation algorithm to effectively separate the AM, which is related to the vocal tract response, and the carrier, which is related to the excitation. The demodulation algorithm is based on the Riesz transform and is developed along the lines of Hilbert-transform-based demodulation for 1-D AM-FM signals. We compare the performance of the Riesz transform technique with that of the sinusoidal demodulation technique on real speech data. Experimental results show that the Riesz-transform-based demodulation technique represents spectrogram patches accurately. The spectrograms reconstructed from the demodulated AM and carrier are inverted and the corresponding speech signal is synthesized. The signal-to-noise ratio (SNR) of the reconstructed speech signal, with respect to clean speech, was found to be 2 to 4 dB higher in case of the Riesz transform technique than the sinusoidal demodulation technique.
Keywords :
Hilbert transforms; amplitude modulation; demodulation; frequency modulation; signal reconstruction; signal synthesis; speech synthesis; 1D AM-FM signal; 2D multicomponent amplitude modulation; Hilbert transform-based demodulation; Riesz transform; frequency modulation model; narrowband speech spectrogram demodulation; signal-to-noise ratio; sinusoidal demodulation technique; spectrogram patch reconstruction; speech signal reconstruction; speech signal synthesis; vocal tract response; voiced speech; Demodulation; Narrowband; Spectrogram; Speech; Speech processing; Time-frequency analysis; Transforms; Amplitude modulation model of spectrograms; Riesz transform; grating compression transform (GCT); multiband AM-FM; sinusoidal demodulation; spectro-temporal analysis;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2449088