Title :
High performance indexing for massive audio fingerprint data
Author :
Ce Yu ; Runtao Wang ; Jian Xiao ; Jizhou Sun
Author_Institution :
Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
Abstract :
Real-time online audio searching systems require high performance indexing architecture for massive audio fingerprints. To improve the performance of hash table based indexing for audio fingerprints, this paper designs and evaluates a hybrid data structure which combines linked list with vector to store the values in the hash table to balance the searching performance and the memory usage. To extend the hash table to cluster environment, three distribution patterns are designed and implemented, and experiments show that the content-oriented distribution pattern is better than the keyoriented distribution pattern. The proposed serialized data layout of the hash table can further improve the searching performance with less memory usage. All the experiments are executed on practical massive data sets including up to 1,000,000 songs, and the results certificate the improvement of the methods proposed.
Keywords :
audio databases; data structures; indexing; information retrieval; pattern clustering; cluster environment; content-oriented distribution pattern; distribution patterns; hash table based indexing; high performance indexing architecture; hybrid data structure; key-oriented distribution pattern; massive audio fingerprint data; memory usage; real-time online audio searching systems; searching performance; serialized data layout; Data structures; Fingerprint recognition; Indexing; Layout; Libraries; Servers; Vectors; Audio fingerprint; hash table; high performance indexing; serialized data layout;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2014.7027344