DocumentCode
705289
Title
Robust temporal alignment of spontaneous and dubbed speech and its application for automatic dialogue replacement
Author
Soens, Pieter ; Verhelst, Werner
Author_Institution
Dept. ETRO-DSSP, Vrije Univ. Brussel, Brussels, Belgium
fYear
2010
fDate
23-27 Aug. 2010
Firstpage
80
Lastpage
84
Abstract
In this paper, we present a robust system for the temporal alignment of 2 renditions of the same speech utterance. The system operates in 2 steps: during analysis, the timing relationships between the speech segments of the utterance that serves as a timing reference and the corresponding speech segments in the replacement utterance are measured by means of a dedicated dynamic time warping algorithm. The obtained warping paths are then processed and used to synthesize a high-quality speech utterance that is time-aligned with the reference. Subjective audio-visual listening tests performed within the context of a difficult Automatic Dialogue Replacement task demonstrated that the proposed system achieves a significant improvement compared to the industry-standard benchmark, both in terms of achieved lip-synchronization accuracy as well as in overall sound quality of the synthesized utterances.
Keywords
audio-visual systems; signal synthesis; speech processing; automatic dialogue replacement; automatic dialogue replacement task; dubbed speech; dynamic time warping algorithm; high-quality speech utterance; industry-standard benchmark; lip-synchronization accuracy; replacement utterance; robust temporal alignment; speech segments; spontaneous speech; subjective audio-visual listening tests; Accuracy; Heuristic algorithms; Signal processing algorithms; Smoothing methods; Speech; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2010 18th European
Conference_Location
Aalborg
ISSN
2219-5491
Type
conf
Filename
7096562
Link To Document