DocumentCode
1693540
Title
A fast table lookup based, statistical model driven non-uniform unit selection TTS
Author
Yao Qian ; Soong, Frank K. ; Xiaobo Zhou ; Yundi Qian ; Xiaotian Zhang
Author_Institution
Microsoft Res. Asia, Beijing, China
fYear
2013
Firstpage
7957
Lastpage
7961
Abstract
For multi-channel TTS applications, e.g. in a cloud service, it is highly desirable that high quality speech can be synthesized in low complexity. In this paper, we propose a fast table lookup based, statistical model driven approach to non-uniform unit selection TTS for that purpose. In TTS training, the voice font of all waveform segments is organized as a Gaussian kernel coded hash table and a table for storing quantized costs of all possible concatenation segment pairs. In synthesis, waveform segments with non-uniform lengths are first selected to construct a candidate lattice by looking up the Gaussian kernel coded hash table, and the best path is searched in the lattice by minimizing the accumulated concatenation scores, which are retrieved from the quantization table for possible concatenations. Experimental results show that the new approach can significantly reduce the search complexity while keep a high TTS voice quality.
Keywords
Gaussian distribution; hidden Markov models; quantisation (signal); speech coding; speech synthesis; table lookup; Gaussian kernel code hash table; TTS training; concatenation segment pair; fast table lookup; high quality speech; multichannel TTS application; nonuniform unit selection TTS; quantization table; quantized cost; statistical model; text-to-speech system; waveform segment; Hidden Markov models; Kernel; Lattices; Speech; Speech synthesis; Training; Trajectory; Hybrid TTS; fast TTS; statistical parametric synthesis; unit-selection based TTS; voice font quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639214
Filename
6639214
Link To Document