• DocumentCode
    312203
  • Title

    A novel approach to the estimation of voice source and vocal tract parameters from speech signals

  • Author

    Ding, Wen ; Kasuya, Hideki

  • Author_Institution
    Fac. of Eng., Utsunomiya Univ., Japan
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1257
  • Abstract
    The paper presents a novel adaptive pitch-synchronous analysis method for simultaneous estimation of voice source and vocal tract (formant/antiformant) parameters from the speech signal. The method uses a parametric Rosenberg-Klatt model to generate a glottal waveform and an autoregressive with exogenous input (ARX) model for representing the speech production process. The time-varying coefficients of the model are estimated with an adaptive algorithm based on a Kalman filter, while the parameters of the Rosenberg-Klatt model are optimized using the simulated annealing method. In addition, a new hybrid error criterion is used to optimize the glottal opening instant. Furthermore, in order to estimate the fundamental period parameter T0, it is defined as two successive glottal closure instants, and is estimated automatically based on the obtained differentiated glottal waveform. Experiments using two-channel speech signals (speech and electroglottograph (EGG) signal) and continuous speech show a good estimation performance
  • Keywords
    Kalman filters; adaptive signal processing; autoregressive processes; bioelectric potentials; parameter estimation; simulated annealing; speech processing; Kalman filter; adaptive algorithm; adaptive pitch-synchronous analysis method; autoregressive with exogenous input model; continuous speech; differentiated glottal waveform; estimation performance; fundamental period parameter; glottal waveform generation; hybrid error criterion; optimized glottal opening instant; parametric Rosenberg-Klatt model; simulated annealing method; speech production process representation; speech signals; successive glottal closure instants; time-varying coefficients; two-channel speech signals; vocal tract parameter estimation; voice source parameter estimation; Adaptive algorithm; Differential equations; Electronic mail; Optimization methods; Signal analysis; Signal processing; Simulated annealing; Speech analysis; Speech coding; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607837
  • Filename
    607837