• DocumentCode
    2178593
  • Title

    Very low bit-rate F0 coding for phonetic vocoder using MSD-HMM with quantized F0 context

  • Author

    Nose, Takashi ; Kobayashi, Takao

  • Author_Institution
    Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5236
  • Lastpage
    5239
  • Abstract
    This paper presents a very low bit-rate F0 coding technique for speaker-dependent phonetic vocoder based on hidden Markov model (HMM) using quantized F0 context. In the proposed technique, the input F0 sequence is converted into F0 symbol sequence at a phoneme level using scalar quantization. The quantized F0 symbols are used in the decoding process as the prosodic context for the HMM-based speech synthesis. The synthetic speech is generated from the context-dependent labels and input speaker´s pre-trained HMMs by using the HMM-based parameter generation algorithm. By taking account account of preceding and succeeding phonemes and F0 symbols as the contextual factors, we can generate smooth F0 trajectory similar to that of the original with only a small number of quantization bits. Experimental results demonstrate that the proposed technique can generate F0 contour with acceptable quality even when the bit-rate is less than 50 bps.
  • Keywords
    hidden Markov models; speech synthesis; vocoders; HMM-based parameter generation algorithm; HMM-based speech synthesis; MSD-HMM; hidden Markov model; quantized F0 context; scalar quantization; speaker-dependent phonetic vocoder; very low bit-rate F0 coding; Context; Hidden Markov models; Quantization; Speech; Speech coding; Speech recognition; F0 context; HMM-based speech synthesis; multi-space distribution HMM; phonetic vocoder; very low bit-rate speech coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947538
  • Filename
    5947538