An Efficient Cascaded Filtering Retrieval Method for Big Audio Data

Author

Shanshan Yao ; Yunsheng Wang ; Baoning Niu

Author_Institution

Dept. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China

Volume

17

Issue

9

fYear

2015

fDate

Sept. 2015

Firstpage

1450

Lastpage

1459

Abstract

Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios on the Internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension for audio fingerprints is still too high to scale up for big audio data. The tradeoff between accuracy (measured by precision and recall rate) and efficiency (measured by retrieval time) prevents further reduction in the dimension of fingerprints. This paper shows that a multi-stage filtering strategy can achieve both speedup and high accuracy, with the beginning stages focusing on speedup and the end stage emphasizing accuracy. With this strategy, an efficient cascaded filtering retrieval method is proposed that consists of filtering with Fibonacci hashing, the middle fingerprint, thresholds to quickly select candidate audios, and refining with an accurate and robust fingerprint on the candidate audios. Experiments with 500 000 audios show that the proposed method can achieve a speed gain more than 28 K times that of the Fibonacci hashing retrieval. After applying MP3 conversion, resampling, white noise addition, and background noise addition, the recall rates of the method are all above 99.45%, and the precision is the same as the Philips audio fingerprint, which is close to 100%.

Keywords

Internet; audio signal processing; filtering theory; information retrieval; Fibonacci hashing retrieval; Internet; MP3 conversion; audio fingerprinting; big audio data; efficient cascaded filtering retrieval method; multistage filtering strategy; Accuracy; Algorithm design and analysis; Audio databases; Filtering; Fingerprint recognition; Robustness; Audio middle fingerprint; Philips audio fingerprint; big audio data; cascade filtering retrieval; content-based retrieval;

fLanguage

English

Journal_Title

Multimedia, IEEE Transactions on

Publisher

ieee

ISSN

1520-9210

Type

jour

DOI

10.1109/TMM.2015.2460121

Filename

7165676