Title :
Audio-visual synchronization recovery in multimedia content
Author :
Lee, Jong-Seok ; Ebrahimi, Touradj
Author_Institution :
Multimedia Signal Process. Group (MMSPG), Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
Abstract :
This paper proposes a method recovering audio-visual synchronization of multimedia content. It exploits the correlation between the acoustic and the visual signals in order to estimate the audio-visual drift existing in the content. By shifting the audio signal relative to the visual signal, the estimation of the drift is obtained by searching for the shift producing the maximal audio-visual correlation. We consider two correlation measures, namely, mutual information and canonical correlation, and compare their performance. Experimental results demonstrate that the method using the canonical correlation is effective in recovering the audio-visual synchronization for both speech and non-speech sequences.
Keywords :
audio-visual systems; multimedia systems; synchronisation; audio-visual synchronization recovery; multimedia content; speech sequence; Acoustics; Correlation; Estimation; Multimedia communication; Speech; Synchronization; Visualization; Audio-visual synchronization; canonical correlation; multimedia; mutual information;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946937