• DocumentCode
    3093697
  • Title

    An Efficient Cascaded Filtering Retrieval Method for Big Audio Data

  • Author

    Shanshan Yao ; Yunsheng Wang ; Baoning Niu

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
  • fYear
    2015
  • fDate
    20-22 April 2015
  • Firstpage
    108
  • Lastpage
    115
  • Abstract
    Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios in the internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension of audio fingerprints is still too high to scale up for big audio data. The tradeoff between the accuracy and the efficiency prevents the further reducing of the dimension of fingerprints. This paper proposes a multi-stage filtering strategy for audio retrieval, with the beginning stages focusing on speed up by using a middle fingerprint with much smaller size to quickly filtering the most likely audios, and the ending stages emphasizing on accuracy by applying an accurate and robust fingerprint on the small set of the most likely audios. A notion called middle fingerprint is devised with considerable small dimension for quickly filtering out most irrelevant audios. A matching algorithm is developed to reduce the computational complexity by comparing the samples at fixed interval of two audios with thresholds. By using the middle fingerprint, audio retrieval can get a speed gain of 12 times on average compared with the Fibonacci Hashing retrieval. By combing the Fibonacci hashing algorithm with the middle filtering retrieval and the matching algorithm, we propose an efficient cascaded filtering retrieval methods, which can further improve the retrieval by 250 times on average. After applying MP3 conversion, resampling, and random shearing, the recall rates of the method are all above 99.47%, and the theoretical accuracy is close to 100%.
  • Keywords
    Big Data; audio signal processing; computational complexity; information filtering; pattern matching; signal sampling; Fibonacci hashing retrieval; MP3 conversion; audio retrieval; big audio data; cascaded filtering retrieval method; computational complexity; matching algorithm; middle filtering retrieval; middle fingerprint; multistage filtering strategy; random shearing; recall rates; resampling; Accuracy; Algorithm design and analysis; Computational efficiency; Databases; Filtering; Fingerprint recognition; Robustness; Philips audio fingerprint; audio middle fingerprint; big audio data; cascade filtering retrieval; efficient retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Big Data (BigMM), 2015 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-8687-3
  • Type

    conf

  • DOI
    10.1109/BigMM.2015.45
  • Filename
    7153863