• DocumentCode
    19434
  • Title

    Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model

  • Author

    Morfi, Veronica ; Degottex, Gilles ; Mouchtaris, Athanasios

  • Author_Institution
    Comput. Sci. Dept., Univ. of Crete, Heraklion, Greece
  • Volume
    23
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    1950
  • Lastpage
    1962
  • Abstract
    Harmonic models have to be both precise and fast in order to represent the speech signal adequately and be able to process large amount of data in a reasonable amount of time. For these purposes, the full-band adaptive harmonic model (aHM) used by the adaptive iterative refinement (AIR) algorithm has been proposed in order to accurately model the perceived characteristics of a speech signal. Even though aHM-AIR is precise, it lacks the computational efficiency that would make its use convenient for large databases. The least squares (LS) solution used in the original aHM-AIR accounts for most of the computational load. In a previous paper, we suggested a peak picking (PP) approach as a substitution to the LS solution. In order to integrate the adaptivity scheme of aHM in the PP approach, an adaptive discrete Fourier transform (aDFT), whose frequency basis can fully follow the variations of the f0 curve, was also proposed. In this paper, we complete the previous publication by evaluating the above methods for the whole analysis process of a speech signal. Evaluations have shown an average time reduction by four times using PP and aDFT compared to the LS solution. Additionally, based on formal listening tests, when using PP and aDFT, the quality of the re-synthesis is preserved compared to the original LS-based approach.
  • Keywords
    discrete Fourier transforms; iterative methods; regression analysis; signal representation; speech processing; speech synthesis; AIR algorithm; aDFT; aHM adaptivity scheme; adaptive discrete Fourier transform; adaptive iterative refinement algorithm; computational load; computationally efficient adaptive harmonic model; full-band adaptive harmonic model; least squares solution; speech analysis; speech signal representation; speech synthesis; Adaptation models; Atmospheric modeling; Computational modeling; Discrete Fourier transforms; Harmonic analysis; Speech; Time-frequency analysis; Fundamental frequency; harmonic model; peak picking (PP); speech analysis/synthesis; voice model;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2458580
  • Filename
    7163319