• DocumentCode
    178611
  • Title

    A sparse smoothing approach for Gaussian Mixture Model based Acoustic-to-Articulatory Inversion

  • Author

    Sudhakar, P. ; Jacques, Laurent ; Ghosh, P.K.

  • Author_Institution
    ICTEAM/ELEN, Univ. Catholique de Louvain, Louvain, Belgium
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3032
  • Lastpage
    3036
  • Abstract
    It is well-known that the performance of the Gaussian Mixture Model (GMM) based Acoustic-to-Articulatory Inversion (AAI) improves by either incorporating smoothness constraint directly in the inversion criterion or smoothing (low-pass filtering) estimated articulator trajectories in a post-processing step, where smoothing is performed independently of the inversion. As the low-pass filtering is independent of inversion, the smoothed articulator trajectory samples no longer remain optimal as per the inversion criterion. In this work, we propose a sparse smoothing technique which constrains the smoothed articulator trajectory to be different from the estimated trajectory only at a sparse subset of samples while simultaneously achieving the required degree of smoothness. Inversion experiments on the articulatory database show that the sparse smoothing achieves an AAI performance similar to that using low-pass filtering but in sparse smoothing ~15% (on average) of the samples in the smoothed articulator trajectory remain identical to those in the estimated articulator trajectory thereby preserve their AAI optimality as opposed to 0% in low-pass filtering.
  • Keywords
    Gaussian processes; acoustic signal processing; low-pass filters; mixture models; smoothing methods; speech synthesis; AAI optimality; GMM based AAI; Gaussian mixture model based acoustic-to-articulatory inversion; articulator trajectory samples; low-pass filtering; sparse smoothing approach; sparse smoothing technique; Acoustics; Hidden Markov models; Optimization; Smoothing methods; Speech; Trajectory; Vectors; ℓ1 minimization; Gaussian mixture model; acoustic-to-articulatory inversion; chambolle-pock; smoothing; sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854157
  • Filename
    6854157