DocumentCode
3153756
Title
Clustering and synchronizing multi-camera video via landmark cross-correlation
Author
Bryan, Nicholas J. ; Smaragdis, Paris ; Mysore, Gautham J.
Author_Institution
Center for Comput. Res. in Music & Acoust., Stanford Univ., Stanford, CA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
2389
Lastpage
2392
Abstract
We propose a method to both identify and synchronize multi-camera video recordings within a large collection of video and/or audio files. Landmark-based audio fingerprinting is used to match multiple recordings of the same event together and time-synchronize each file within the groups. Compared to prior work, we offer improvements towards event identification and a new synchronization refinement method that resolves inconsistent estimates and allows non-overlapping content to be synchronized within larger groups of recordings. Furthermore, the audio fingerprinting-based synchronization is shown to be equivalent to an efficient and scalable time-difference-of-arrival method using cross-correlation performed on a non-linearly transformed signal.
Keywords
audio coding; direction-of-arrival estimation; estimation theory; pattern clustering; synchronisation; video cameras; video signal processing; audio files; audio fingerprinting-based synchronization; event identification; inconsistent estimates; landmark cross-correlation; landmark-based audio fingerprinting; multicamera video clustering; multicamera video recordings; multicamera video synchronizing; multiple recordings; nonlinearly transformed signal; nonoverlapping content; synchronization refinement method; time-difference-of-arrival method; video files; Correlation; Estimation; Feature extraction; Speech; Synchronization; Time difference of arrival; Time frequency analysis; Video and audio synchronization; audio fingerprinting;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288396
Filename
6288396
Link To Document