Title :
Speech synthesis using HMM based diphone inventory encoding for low-resource devices
Author :
Strecha, Guntram ; Wolff, Matthias
Author_Institution :
Tech. Univ. Dresden, Dresden, Germany
Abstract :
In this paper we describe the compression of diphone inventories used by the acoustic synthesis of a concatenative synthesis system. The inventory compression is based on a codebook drawn from the Gaussian mean vectors of phoneme HMMs. There are two encoding/synthesis schemes, a speaker dependent and a speaker independent one. The advantage of the latter is the potential common use of the HM-models by a recognizer and a synthesizer. We describe the steps to encode the inventories as well as the acoustic synthesis using them. Using the proposed method a diphone inventory with 1175 units can be compressed down to 19 kB. We will show that the synthesis quality with HMM-encoded inventories matches the quality of synthesis with AMRor SPEEX-encoded inventories at noticeably smaller inventory sizes.
Keywords :
Gaussian processes; data compression; hidden Markov models; speech coding; speech synthesis; vectors; AMR-encoded inventory; Gaussian mean vector; HMM based diphone inventory encoding; SPEEX-encoded inventory; acoustic synthesis; codebook; concatenative synthesis system; diphone inventory compression; low-resource device; phoneme HMM; speaker dependent; speaker independent; speech synthesis; Encoding; Hidden Markov models; Indexes; Speech; Speech recognition; Training; Hidden Markov Models; Speech coding; Speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947574