DocumentCode
310623
Title
A time varying ARMAX speech modeling with phase compensation using glottal source model
Author
Funaki, Keiichi ; Miyanaga, Yoshikazu ; Tochinai, Koji
Author_Institution
Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1299
Abstract
This paper presents new speech analysis method based on a glottal-ARMAX (autoregressive and moving average exogenous) model with phase compensation. A glottal-ARMAX model consists of two kinds of inputs: glottal source model excitation and a white Gaussian input, and a vocal tract ARMAX model. The proposed method can simultaneously estimate the glottal source model and vocal tract ARMAX model parameters pitch synchronously. In this method, ARMAX identification using a modified MIS (model identification system) method is adopted to estimate the ARMAX parameters, and the hybrid approach of the genetic algorithm (GA) and simulated annealing (SA) is employed to efficiently solve the non-linear simultaneous optimization of both parameters. Furthermore, phase compensation using an all-pass filter is introduced within a generation loop in the GA method in order to compensate phase distortion. Experiments using synthetic speech and natural speech demonstrate the efficacy of the proposed method
Keywords
Gaussian processes; all-pass filters; autoregressive moving average processes; filtering theory; genetic algorithms; parameter estimation; simulated annealing; speech processing; speech synthesis; time-varying systems; ARMAX identification; all-pass filter; autoregressive moving average exogenous model; efficiency; experiments; genetic algorithm; glottal ARMAX model; glottal source model; glottal source model excitation; modified model identification system; natural speech; nonlinear simultaneous optimization; phase compensation; pitch synchronous parameter estimation; simulated annealing; speech analysis method; synthetic speech; time varying ARMAX speech modeling; vocal tract ARMAX model; white Gaussian input; Filters; Gaussian processes; Natural languages; Optimization methods; Phase distortion; Phase estimation; Simulated annealing; Speech analysis; Speech coding; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596184
Filename
596184
Link To Document