DocumentCode :
23087
Title :
An Efficient Cascaded Filtering Retrieval Method for Big Audio Data
Author :
Shanshan Yao ; Yunsheng Wang ; Baoning Niu
Author_Institution :
Dept. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
Volume :
17
Issue :
9
fYear :
2015
fDate :
Sept. 2015
Firstpage :
1450
Lastpage :
1459
Abstract :
Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios on the Internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension for audio fingerprints is still too high to scale up for big audio data. The tradeoff between accuracy (measured by precision and recall rate) and efficiency (measured by retrieval time) prevents further reduction in the dimension of fingerprints. This paper shows that a multi-stage filtering strategy can achieve both speedup and high accuracy, with the beginning stages focusing on speedup and the end stage emphasizing accuracy. With this strategy, an efficient cascaded filtering retrieval method is proposed that consists of filtering with Fibonacci hashing, the middle fingerprint, thresholds to quickly select candidate audios, and refining with an accurate and robust fingerprint on the candidate audios. Experiments with 500 000 audios show that the proposed method can achieve a speed gain more than 28 K times that of the Fibonacci hashing retrieval. After applying MP3 conversion, resampling, white noise addition, and background noise addition, the recall rates of the method are all above 99.45%, and the precision is the same as the Philips audio fingerprint, which is close to 100%.
Keywords :
Internet; audio signal processing; filtering theory; information retrieval; Fibonacci hashing retrieval; Internet; MP3 conversion; audio fingerprinting; big audio data; efficient cascaded filtering retrieval method; multistage filtering strategy; Accuracy; Algorithm design and analysis; Audio databases; Filtering; Fingerprint recognition; Robustness; Audio middle fingerprint; Philips audio fingerprint; big audio data; cascade filtering retrieval; content-based retrieval;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2015.2460121
Filename :
7165676
Link To Document :
بازگشت