Title :
Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals
Author :
Kawahara, Hideki ; Morise, Masanori ; Toda, Tomoki ; Banneo, Hideki ; Nisimura, Ryuichi ; Irino, Toshio
Author_Institution :
Fac. of Syst. Eng., Wakayama Univ., Wakayama, Japan
Abstract :
A new group delay representation, which yields value zero for periodic signals irrespective to the initial phase and the relative level of each harmonic component. This new group delay representation provides a unified basis for defining "aperiodicity" in speech sounds. For example, the periodic to noise ratio or harmonic to noise ratio is directly derived from the deviation of this group delay representation from value zero, after removing FM effects of harmonic frequencies and removing AM effects of harmonic component level. The derived deviation is combined with estimated excitation duration information and used to design aperiodic components of excitation source for high-quality synthetic speech. The proposed group delay representation is based on FO-adaptive weighted average of frequency shifted versions and temporally shifted versions of group delays with power spectral weighting.
Keywords :
speech processing; AM effects; FM effects; frequency shifted versions; harmonic component level; harmonic frequencies; high-quality speech manipulation systems; noise ratio harmonic; periodic signals; power spectral weighting; source design excitation; speech sounds; temporally static group delay representation; Delays; Educational institutions; Equations; Harmonic analysis; Mathematical model; Spectral analysis; Speech;
Conference_Titel :
Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)
Conference_Location :
Siem Reap
DOI :
10.1109/APSIPA.2014.7041594