DocumentCode
3093697
Title
An Efficient Cascaded Filtering Retrieval Method for Big Audio Data
Author
Shanshan Yao ; Yunsheng Wang ; Baoning Niu
Author_Institution
Coll. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
fYear
2015
fDate
20-22 April 2015
Firstpage
108
Lastpage
115
Abstract
Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios in the internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension of audio fingerprints is still too high to scale up for big audio data. The tradeoff between the accuracy and the efficiency prevents the further reducing of the dimension of fingerprints. This paper proposes a multi-stage filtering strategy for audio retrieval, with the beginning stages focusing on speed up by using a middle fingerprint with much smaller size to quickly filtering the most likely audios, and the ending stages emphasizing on accuracy by applying an accurate and robust fingerprint on the small set of the most likely audios. A notion called middle fingerprint is devised with considerable small dimension for quickly filtering out most irrelevant audios. A matching algorithm is developed to reduce the computational complexity by comparing the samples at fixed interval of two audios with thresholds. By using the middle fingerprint, audio retrieval can get a speed gain of 12 times on average compared with the Fibonacci Hashing retrieval. By combing the Fibonacci hashing algorithm with the middle filtering retrieval and the matching algorithm, we propose an efficient cascaded filtering retrieval methods, which can further improve the retrieval by 250 times on average. After applying MP3 conversion, resampling, and random shearing, the recall rates of the method are all above 99.47%, and the theoretical accuracy is close to 100%.
Keywords
Big Data; audio signal processing; computational complexity; information filtering; pattern matching; signal sampling; Fibonacci hashing retrieval; MP3 conversion; audio retrieval; big audio data; cascaded filtering retrieval method; computational complexity; matching algorithm; middle filtering retrieval; middle fingerprint; multistage filtering strategy; random shearing; recall rates; resampling; Accuracy; Algorithm design and analysis; Computational efficiency; Databases; Filtering; Fingerprint recognition; Robustness; Philips audio fingerprint; audio middle fingerprint; big audio data; cascade filtering retrieval; efficient retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Big Data (BigMM), 2015 IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-8687-3
Type
conf
DOI
10.1109/BigMM.2015.45
Filename
7153863
Link To Document