• DocumentCode
    3526317
  • Title

    Application of voice conversion for cross-language rap singing transformation

  • Author

    Turk, Oytun ; Buyuk, Osman ; Haznedaroglu, Ali ; Arslan, Levent M.

  • Author_Institution
    Speech Group, DFKI GmbH Language Technol. Lab., Berlin
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    3597
  • Lastpage
    3600
  • Abstract
    Voice conversion enables generation of a desired speaker´s voice from audio recordings of another speaker. In this paper, we focus on a music application and describe the first steps towards generating voices of music celebrities using conventional voice conversion techniques. Specifically, rap singing transformations from English to Spanish are performed using parallel training material in English. Weighted codebook mapping based voice conversion with two different alignment methods and temporal smoothing of the transformation filter are employed. The first aligner uses a HMM trained for each source recording to force-align the corresponding target recording. The second aligner employs speaker-independent HMMs trained from a large number of speakers. Additionally, a smoothing step is devised to reduce discontinuities and to improve performance. The results of subjective evaluations indicate that both aligners perform equivalently well. The proposed smoothing technique improves both similarity to target singer and quality significantly regardless of the alignment method.
  • Keywords
    acoustic signal processing; filtering theory; hidden Markov models; smoothing methods; English; HMM; Spanish; alignment methods; audio recordings; cross-language rap singing transformation; music; parallel training material; speaker voice; temporal smoothing; transformation filter; voice conversion techniques; weighted codebook mapping; Audio recording; Filters; Hidden Markov models; Performance evaluation; Smoothing methods; singing voice transformation; voice conversion; weighted codebook mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960404
  • Filename
    4960404