DocumentCode
3334621
Title
Similarity searching: Towards bulk-loading peer-to-peer networks
Author
Dohnal, Vlastislav ; Sedmidubský, Jan ; Zezula, Pavel ; Novák, David
Author_Institution
Fac. of Inf., Masaryk Univ., Brno
fYear
2008
fDate
7-12 April 2008
Firstpage
378
Lastpage
385
Abstract
Due to the exponential growth of digital data and its complexity, we need a technique which allows us to search such collections efficiently. A suitable solution seems to be based on the peer-to-peer (P2P) network paradigm and the metric-space model of similarity. During the building phase of the distributed structure, the peers often split as new peers join the network. During a peer split, the local data is halved and one half is migrated to the new peer. In this paper, we study the problem of efficient splits of metric data locally organized by an M-tree and we propose a novel algorithm for speeding the splits up. In particular, we focus on the metric-based structured P2P network called the M-Chord. In experimental evaluation, we compare the proposed algorithm with several straightforward solutions on a real network organizing 10 million images. Our algorithm provides a significant performance boost.
Keywords
distributed databases; peer-to-peer computing; query processing; tree data structures; very large databases; M-Chord metric-based structured P2P network; M-tree; bulk-loading peer-to-peer network paradigm; distributed structure; similarity metric-space model; similarity searching; Buildings; Costs; Data processing; Database systems; Extraterrestrial measurements; Image retrieval; Informatics; Multimedia databases; Organizing; Peer to peer computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering Workshop, 2008. ICDEW 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-2161-9
Electronic_ISBN
978-1-4244-2162-6
Type
conf
DOI
10.1109/ICDEW.2008.4498348
Filename
4498348
Link To Document