• DocumentCode
    730663
  • Title

    Evaluation of speech inverse filtering techniques using a physiologically based synthesizer

  • Author

    Gudnason, Jon ; Mehta, Daryush D. ; Quatieri, Thomas F.

  • Author_Institution
    Center for Anal. & Design of Intell. Agents, Reykjavik Univ., Menntavegur, Iceland
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4245
  • Lastpage
    4249
  • Abstract
    Glottal inverse filtering methods are designed to derive a glottal flow waveform from a speech signal. In this paper, we evaluate and compare such methods using a speech synthesizer that simulates voice production in a physiologically-based manner that includes complexities such as nonlinear source-tract coupling. Five inverse filtering techniques are evaluated on 90 synthesized speech waveforms generated by setting six vowel configurations, three glottal models, and five fundamental frequencies. Using normalized mean square error as the primary performance metric of the estimated glottal flow derivative, results show that the accuracy of all methods depends on the configuration of the vocal tract, glottis and the fundamental frequency. Averaged over these conditions, the closed phase covariance and one weighted covariance algorithm yield lower error rates (0.41 ± 0.2) than iterative and adaptive inverse filtering (0.49 ± 0.1) and complex cepstrum decomposition (0.76 ± 0.1).
  • Keywords
    acoustic signal processing; covariance analysis; iterative methods; speech processing; adaptive inverse filtering; closed phase covariance algorithm; complex cepstrum decomposition; glottal flow derivative; glottal flow waveform; glottal inverse filtering methods; iterative inverse filtering; nonlinear source-tract coupling; normalized mean square error; physiologically based synthesizer; speech inverse filtering; speech signal; vocal tract; weighted covariance algorithm; Computational modeling; Estimation; Filtering; Speech; Speech processing; Synthesizers; Glottal inverse filtering; acoustics; glottal closure instant detection; glottal flow; speech signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178771
  • Filename
    7178771