• DocumentCode
    1693540
  • Title

    A fast table lookup based, statistical model driven non-uniform unit selection TTS

  • Author

    Yao Qian ; Soong, Frank K. ; Xiaobo Zhou ; Yundi Qian ; Xiaotian Zhang

  • Author_Institution
    Microsoft Res. Asia, Beijing, China
  • fYear
    2013
  • Firstpage
    7957
  • Lastpage
    7961
  • Abstract
    For multi-channel TTS applications, e.g. in a cloud service, it is highly desirable that high quality speech can be synthesized in low complexity. In this paper, we propose a fast table lookup based, statistical model driven approach to non-uniform unit selection TTS for that purpose. In TTS training, the voice font of all waveform segments is organized as a Gaussian kernel coded hash table and a table for storing quantized costs of all possible concatenation segment pairs. In synthesis, waveform segments with non-uniform lengths are first selected to construct a candidate lattice by looking up the Gaussian kernel coded hash table, and the best path is searched in the lattice by minimizing the accumulated concatenation scores, which are retrieved from the quantization table for possible concatenations. Experimental results show that the new approach can significantly reduce the search complexity while keep a high TTS voice quality.
  • Keywords
    Gaussian distribution; hidden Markov models; quantisation (signal); speech coding; speech synthesis; table lookup; Gaussian kernel code hash table; TTS training; concatenation segment pair; fast table lookup; high quality speech; multichannel TTS application; nonuniform unit selection TTS; quantization table; quantized cost; statistical model; text-to-speech system; waveform segment; Hidden Markov models; Kernel; Lattices; Speech; Speech synthesis; Training; Trajectory; Hybrid TTS; fast TTS; statistical parametric synthesis; unit-selection based TTS; voice font quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639214
  • Filename
    6639214