• DocumentCode
    417102
  • Title

    Non-parallel training for voice conversion by maximum likelihood constrained adaptation

  • Author

    Mouchtaris, Athanasios ; Van der Spiegel, Jan ; Mueller, Paul

  • Author_Institution
    Dept. of Electr. & Syst. Eng., Pennsylvania Univ., Philadelphia, PA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The objective of voice conversion methods is to modify the speech characteristics of a particular speaker in such manner, as to sound like speech by a different target speaker. Current voice conversion algorithms are based on deriving a conversion function by estimating its parameters through a corpus that contains the same utterances spoken by both speakers. Such a corpus, usually referred to as a parallel corpus, has the disadvantage that many times it is difficult or even impossible to collect. Here, we propose a voice conversion method that does not require a parallel corpus for training, i.e. the spoken utterances by the two speakers need not be the same, by employing speaker adaptation techniques to adapt to a particular pair of source and target speakers, the derived conversion parameters from a different pair of speakers. We show that adaptation reduces the error obtained when simply applying the conversion parameters of one pair of speakers to another by a factor that can reach 30% in many cases, and with performance comparable with the ideal case when a parallel corpus is available.
  • Keywords
    maximum likelihood estimation; speaker recognition; speech processing; maximum likelihood constrained adaptation; nonparallel training; performance; speaker adaptation techniques; spoken utterances; voice conversion; Acoustical engineering; Loudspeakers; Maximum likelihood estimation; Parameter estimation; Signal design; Speech synthesis; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325907
  • Filename
    1325907