Title :
Mixed-phase AR models for voiced speech and perceptual cost functions
Author :
Gardner, William R. ; Rao, Bhaskar D.
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
Mixed-phase AR models are introduced for encoding the magnitudes and phases of the harmonics of voiced speech. Motivation for the use of the mixed-phase AR models is given and several cost functions are introduced, forming the basis for algorithms which estimate the model parameters. An efficient algorithm based on a quasi-linear least squares approach is presented, and a more sophisticated algorithm based on the perceptual masking properties of the ear is described. When the algorithms are used to model voiced speech signals using a 14th order mixed-phase model, high quality speech can be produced
Keywords :
autoregressive processes; ear; harmonics; least mean squares methods; parameter estimation; speech coding; speech intelligibility; algorithms; ear; harmonics; high quality speech; magnitudes; mixed-phase AR models; model parameters estimation; perceptual cost functions; perceptual masking properties; phases; quasi-linear least squares; speech coding; voiced speech signals; Acoustic pulses; Cost function; Finite impulse response filter; Frequency domain analysis; Integrated circuit modeling; Phase measurement; Power harmonic filters; Pulse shaping methods; Shape; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389319