DocumentCode :
3717469
Title :
Indexing media storms on Flink
Author :
Dimitrios Rafailidis;Stefanos Antaris
Author_Institution :
Department of Informatics, Aristotle University of Thessaloniki
fYear :
2015
Firstpage :
2836
Lastpage :
2838
Abstract :
We propose a media storm indexing algorithm using Map-Reduce in our recently proposed CDVC framework. In this study, CDVC is built on Flink, an open-source platform for stream data processing. The question we answer is how to store massive image collections; for instance, with over one million images per second, as well as with varying incoming rate. In our experiments with two benchmark datasets of 80M and 1B image descriptors, we evaluate the proposed algorithm on different indexing workloads, that is, images that come with high volume and different velocity at the scale of 105-106 images per second. Using a limited set of computational nodes, we show that we achieve a significant speed up factor of nine, on average, compared to conventional indexing techniques, in all settings. Finally, we make our source code publicly available.
Keywords :
"Indexing","Media","Storms","Gaussian distribution","Standards","Big data"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7364094
Filename :
7364094
Link To Document :
بازگشت