• DocumentCode
    1687431
  • Title

    A style capturing approach to F0 transformation in voice conversion

  • Author

    Krishna Anumanchipalli, Gopala ; Oliveira, Luis C. ; Black, Alan W.

  • Author_Institution
    Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2013
  • Firstpage
    6915
  • Lastpage
    6919
  • Abstract
    In this paper, we present a new approach to F0 transformation, that can capture aspects of speaking style. Instead of using the traditional 5ms frames as units in transformation, we propose a method that looks at longer phonological regions such as metrical feet. We automatically detect metrical feet in the source speech, and for each of source speaker´s feet, we find its phonological correspondence in target speech. We use a statistical phrase accent model to represent the F0 contour, where a 4-dimensional TILT representation is used for the F0 is parameterized over each feet region for the source and target speakers. This forms the parallel data that is the training data for our transformation. We transform the phrase component using simple z-score mapping. We use a joint density Gaussian mixture model to transform the accent contours. Our transformation method generates F0 contours that are significantly more correlated with the target speech than a baseline, frame-based method.
  • Keywords
    Gaussian processes; speaker recognition; statistical analysis; 4-dimensional TILT representation; F0 contour representation; F0 transformation; accent contour transformation; automatic metrical feet detection; joint density Gaussian mixture model; parallel data; phonological correspondence; phonological regions; phrase component; source speaker feet; source speech; speaking style; statistical phrase accent model; style capturing approach; training data; voice conversion; z-score mapping; Correlation; Foot; Shape; Speech; Stress; Transforms; Vectors; F0; Metrical Foot; Prosody Transformation; Voice Conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639002
  • Filename
    6639002