DocumentCode :
705289
Title :
Robust temporal alignment of spontaneous and dubbed speech and its application for automatic dialogue replacement
Author :
Soens, Pieter ; Verhelst, Werner
Author_Institution :
Dept. ETRO-DSSP, Vrije Univ. Brussel, Brussels, Belgium
fYear :
2010
fDate :
23-27 Aug. 2010
Firstpage :
80
Lastpage :
84
Abstract :
In this paper, we present a robust system for the temporal alignment of 2 renditions of the same speech utterance. The system operates in 2 steps: during analysis, the timing relationships between the speech segments of the utterance that serves as a timing reference and the corresponding speech segments in the replacement utterance are measured by means of a dedicated dynamic time warping algorithm. The obtained warping paths are then processed and used to synthesize a high-quality speech utterance that is time-aligned with the reference. Subjective audio-visual listening tests performed within the context of a difficult Automatic Dialogue Replacement task demonstrated that the proposed system achieves a significant improvement compared to the industry-standard benchmark, both in terms of achieved lip-synchronization accuracy as well as in overall sound quality of the synthesized utterances.
Keywords :
audio-visual systems; signal synthesis; speech processing; automatic dialogue replacement; automatic dialogue replacement task; dubbed speech; dynamic time warping algorithm; high-quality speech utterance; industry-standard benchmark; lip-synchronization accuracy; replacement utterance; robust temporal alignment; speech segments; spontaneous speech; subjective audio-visual listening tests; Accuracy; Heuristic algorithms; Signal processing algorithms; Smoothing methods; Speech; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2010 18th European
Conference_Location :
Aalborg
ISSN :
2219-5491
Type :
conf
Filename :
7096562
Link To Document :
بازگشت