• DocumentCode
    705289
  • Title

    Robust temporal alignment of spontaneous and dubbed speech and its application for automatic dialogue replacement

  • Author

    Soens, Pieter ; Verhelst, Werner

  • Author_Institution
    Dept. ETRO-DSSP, Vrije Univ. Brussel, Brussels, Belgium
  • fYear
    2010
  • fDate
    23-27 Aug. 2010
  • Firstpage
    80
  • Lastpage
    84
  • Abstract
    In this paper, we present a robust system for the temporal alignment of 2 renditions of the same speech utterance. The system operates in 2 steps: during analysis, the timing relationships between the speech segments of the utterance that serves as a timing reference and the corresponding speech segments in the replacement utterance are measured by means of a dedicated dynamic time warping algorithm. The obtained warping paths are then processed and used to synthesize a high-quality speech utterance that is time-aligned with the reference. Subjective audio-visual listening tests performed within the context of a difficult Automatic Dialogue Replacement task demonstrated that the proposed system achieves a significant improvement compared to the industry-standard benchmark, both in terms of achieved lip-synchronization accuracy as well as in overall sound quality of the synthesized utterances.
  • Keywords
    audio-visual systems; signal synthesis; speech processing; automatic dialogue replacement; automatic dialogue replacement task; dubbed speech; dynamic time warping algorithm; high-quality speech utterance; industry-standard benchmark; lip-synchronization accuracy; replacement utterance; robust temporal alignment; speech segments; spontaneous speech; subjective audio-visual listening tests; Accuracy; Heuristic algorithms; Signal processing algorithms; Smoothing methods; Speech; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2010 18th European
  • Conference_Location
    Aalborg
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7096562