• DocumentCode
    3423006
  • Title

    Doppler-variant modeling of the vocal tract

  • Author

    Heim, Axel ; Sorger, Uli ; Hug, Florian

  • Author_Institution
    Inst. of Telecommun. & Appl. Inf. Theor., Ulm Univ., Ulm
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4197
  • Lastpage
    4200
  • Abstract
    A common technique to deploy linear prediction to non-stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inaccuracies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear´s insensitivity to absolute phase. The periodicity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract modeling is significantly reduced compared to a LPC system with block-wise adaptation while exceeding its prediction gain. As a by-product it is found that the Doppler frequency of the vocal tract is in the order of 10 Hz. A generalization of the algorithm to an auto-regressive moving average model with time-correlated filter coefficients is straight forward.
  • Keywords
    filtering theory; prediction theory; speech coding; Doppler domain; Doppler frequency; Doppler-variant modeling; Fourier series; all-pass filtering; auto-regressive moving average model; block-wise adaptation; filter stability; linear prediction coefficient; non-stationary signal; prediction gain; predictor order; time segmentation; time-correlated filter coefficient; vocal tract modeling; Entropy; Filters; Fourier series; Frequency; Information theory; Linear predictive coding; Predictive models; Shape; Speech coding; Speech processing; Fourier series; Linear predictive coding; Speech coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518580
  • Filename
    4518580