Title :
Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model
Author :
Morfi, Veronica ; Degottex, Gilles ; Mouchtaris, Athanasios
Author_Institution :
Comput. Sci. Dept., Univ. of Crete, Heraklion, Greece
Abstract :
Harmonic models have to be both precise and fast in order to represent the speech signal adequately and be able to process large amount of data in a reasonable amount of time. For these purposes, the full-band adaptive harmonic model (aHM) used by the adaptive iterative refinement (AIR) algorithm has been proposed in order to accurately model the perceived characteristics of a speech signal. Even though aHM-AIR is precise, it lacks the computational efficiency that would make its use convenient for large databases. The least squares (LS) solution used in the original aHM-AIR accounts for most of the computational load. In a previous paper, we suggested a peak picking (PP) approach as a substitution to the LS solution. In order to integrate the adaptivity scheme of aHM in the PP approach, an adaptive discrete Fourier transform (aDFT), whose frequency basis can fully follow the variations of the f0 curve, was also proposed. In this paper, we complete the previous publication by evaluating the above methods for the whole analysis process of a speech signal. Evaluations have shown an average time reduction by four times using PP and aDFT compared to the LS solution. Additionally, based on formal listening tests, when using PP and aDFT, the quality of the re-synthesis is preserved compared to the original LS-based approach.
Keywords :
discrete Fourier transforms; iterative methods; regression analysis; signal representation; speech processing; speech synthesis; AIR algorithm; aDFT; aHM adaptivity scheme; adaptive discrete Fourier transform; adaptive iterative refinement algorithm; computational load; computationally efficient adaptive harmonic model; full-band adaptive harmonic model; least squares solution; speech analysis; speech signal representation; speech synthesis; Adaptation models; Atmospheric modeling; Computational modeling; Discrete Fourier transforms; Harmonic analysis; Speech; Time-frequency analysis; Fundamental frequency; harmonic model; peak picking (PP); speech analysis/synthesis; voice model;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2015.2458580