• DocumentCode
    730695
  • Title

    Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE

  • Author

    Uria, Benigno ; Murray, Iain ; Renals, Steve ; Valentini-Botinhao, Cassia ; Bridle, John

  • Author_Institution
    Inst. for Adaptive & Neural Comput., Univ. of Edinburgh, Edinburgh, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4465
  • Lastpage
    4469
  • Abstract
    Given a transcription, sampling from a good model of acoustic feature trajectories should result in plausible realizations of an utterance. However, samples from current probabilistic speech synthesis systems result in low quality synthetic speech. Henter et al. have demonstrated the need to capture the dependencies between acoustic features conditioned on the phonetic labels in order to obtain high quality synthetic speech. These dependencies are often ignored in neural network based acoustic models. We tackle this deficiency by introducing a probabilistic neural network model of acoustic trajectories, trajectory RNADE, able to capture these dependencies.
  • Keywords
    acoustic signal processing; neural nets; speech synthesis; acoustic feature dependencies; artificial neural networks; phonetic labels; probabilistic speech synthesis; trajectory RNADE; utterance; Acoustics; Computational modeling; Europe; Hidden Markov models; Joints; Probabilistic logic; RNADE; Speech synthesis; acoustic modelling; artificial neural networks; trajectory model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178815
  • Filename
    7178815