• DocumentCode
    1723829
  • Title

    Voice conversion based on empirical conditional distribution in resource-limited scenarios

  • Author

    Ning Xu ; Yibin Tang ; Jingyi Bao ; Xiao Yao ; Aimin Jiang ; Xiaofeng Liu

  • Author_Institution
    Coll. of IoT Eng., Hohai Univ., Changzhou, China
  • fYear
    2015
  • Firstpage
    172
  • Lastpage
    173
  • Abstract
    In this paper, a computationally efficient voice conversion system has been designed in order to improve the performance in resource-limited scenarios. First, mixtures of Gaussians (MoGs) at fixed locations of Mel frequencies have been used to represent the spectrum of STRAIGHT compactly. Second, the key conditional distributions for prediction are approximated by building histograms of aligned features empirically. Experiments have confirmed that our proposed method can obtain fairly good results compared to the traditional method without huge computational costs.
  • Keywords
    Gaussian processes; speech enhancement; Gaussians mixtures; Mel frequencies; MoGs; STRAIGHT compactly spectrum; empirical conditional distribution; mixtures of Gaussians; voice conversion system; Buildings; Computational efficiency; Computational modeling; Feature extraction; Histograms; Speech; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
  • Conference_Location
    Taipei
  • Type

    conf

  • DOI
    10.1109/ICCE-TW.2015.7216839
  • Filename
    7216839