Title :
Modeling the unvoiced component in the canonical representation of speech
Author :
Ramírez, Miguel Arjona
Author_Institution :
Escola Politec., Univ. of Sao Paulo, Sao Paulo, Brazil
Abstract :
The canonical representation of speech constitutes a perfect reconstruction (PR) analysis-synthesis system. Its parameters are the autoregressive (AR) model coefficients, the pitch period and the voiced and unvoiced components of the excitation represented as transform coefficients. Each set of parameters may be operated on independently. A time-frequency unvoiced excitation (TFUNEX) model is proposed that has high time resolution and selective frequency resolution. Improved time-frequency fit is obtained by using for antialiasing cancellation the clustering of pitch-synchronous transform tracks defined in the modulation transform domain. The TFUNEX model delivers high-quality speech while compressing the unvoiced excitation representation about 13 times over its raw transform coefficient representation for wideband speech.
Keywords :
autoregressive processes; data compression; pattern clustering; signal reconstruction; signal representation; speech coding; time-frequency analysis; antialiasing cancellation; autoregressive model coefficient; high time resolution; modulation transform domain; perfect reconstruction analysis-synthesis system; pitch-synchronous transform clustering; selective frequency resolution; speech canonical representation; time-frequency unvoiced excitation model; transform coefficient; unvoiced excitation representation compression; Bit rate; Independent component analysis; Modulation coding; Speech analysis; Speech coding; Speech enhancement; Speech synthesis; Stochastic processes; Time frequency analysis; Wideband; modulation transform; scalable coding; speech analysis; speech coding; time-frequency analysis;
Conference_Titel :
Digital Signal Processing, 2009 16th International Conference on
Conference_Location :
Santorini-Hellas
Print_ISBN :
978-1-4244-3297-4
Electronic_ISBN :
978-1-4244-3298-1
DOI :
10.1109/ICDSP.2009.5201232