Title :
Identification and reconstruction of the unvoiced component in speech
Author :
Kleijn, W. Bastiaan ; Jefremov, Andrea ; Murthi, Manohar N.
Author_Institution :
Dept. of Speech, Music & Hearing, R. Inst. of Technol., Stockholm, Sweden
fDate :
Oct. 29 2000-Nov. 1 2000
Abstract :
Speech is well described by a source-filter model. The source properties are critical for good quality reconstructed speech. We describe a source model which facilitates both low-rate coding and signal modification. The source signal is described by means of pitch-synchronous frame expansions, with different subsets of the coefficients corresponding to so-called voiced and unvoiced components. To obtain a perceptually plausible voiced-unvoiced decomposition even at speech onsets, our frame functions adapt to the signal. The generation of the unvoiced component consists of the replacement of the corresponding coefficients with realizations of a random variable with similar statistics. Existing sinusoidal and waveform-interpolation excitation models form approximations to the presented procedure.
Keywords :
channel bank filters; identification; source coding; speech coding; frame functions; good quality reconstructed speech; identification; low-rate coding; perceptually plausible voiced-unvoiced decomposition; pitch-synchronous frame expansions; random variable; reconstruction; signal modification; sinusoidal excitation models; source-filter model; speech onsets; unvoiced component; voiced component; waveform-interpolation excitation models; Acoustic noise; Auditory system; Gaussian noise; Lungs; Predictive models; Random variables; Resonance; Speech enhancement; Speech synthesis; Statistics;
Conference_Titel :
Signals, Systems and Computers, 2000. Conference Record of the Thirty-Fourth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
Print_ISBN :
0-7803-6514-3
DOI :
10.1109/ACSSC.2000.911232