• DocumentCode
    310623
  • Title

    A time varying ARMAX speech modeling with phase compensation using glottal source model

  • Author

    Funaki, Keiichi ; Miyanaga, Yoshikazu ; Tochinai, Koji

  • Author_Institution
    Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1299
  • Abstract
    This paper presents new speech analysis method based on a glottal-ARMAX (autoregressive and moving average exogenous) model with phase compensation. A glottal-ARMAX model consists of two kinds of inputs: glottal source model excitation and a white Gaussian input, and a vocal tract ARMAX model. The proposed method can simultaneously estimate the glottal source model and vocal tract ARMAX model parameters pitch synchronously. In this method, ARMAX identification using a modified MIS (model identification system) method is adopted to estimate the ARMAX parameters, and the hybrid approach of the genetic algorithm (GA) and simulated annealing (SA) is employed to efficiently solve the non-linear simultaneous optimization of both parameters. Furthermore, phase compensation using an all-pass filter is introduced within a generation loop in the GA method in order to compensate phase distortion. Experiments using synthetic speech and natural speech demonstrate the efficacy of the proposed method
  • Keywords
    Gaussian processes; all-pass filters; autoregressive moving average processes; filtering theory; genetic algorithms; parameter estimation; simulated annealing; speech processing; speech synthesis; time-varying systems; ARMAX identification; all-pass filter; autoregressive moving average exogenous model; efficiency; experiments; genetic algorithm; glottal ARMAX model; glottal source model; glottal source model excitation; modified model identification system; natural speech; nonlinear simultaneous optimization; phase compensation; pitch synchronous parameter estimation; simulated annealing; speech analysis method; synthetic speech; time varying ARMAX speech modeling; vocal tract ARMAX model; white Gaussian input; Filters; Gaussian processes; Natural languages; Optimization methods; Phase distortion; Phase estimation; Simulated annealing; Speech analysis; Speech coding; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596184
  • Filename
    596184