• DocumentCode
    2452192
  • Title

    Audio fingerprint based on Spectral Flux for audio retrieval

  • Author

    Wengen Wang ; Xiaoqing Yu ; Yun Hui Wang ; Swaminathan, R.

  • Author_Institution
    Shool of Commun. & Inf. Eng., Shanghai Univ., Shanghai, China
  • fYear
    2012
  • fDate
    16-18 July 2012
  • Firstpage
    1104
  • Lastpage
    1107
  • Abstract
    In audio fingerprinting, an audio clip must be recognized by matching an extracted fingerprint to a database of previously computed fingerprints. The fingerprints should reduce the dimensionality of the input significantly, provide discrimination among different audio clips, and, at the same time, be invariant to distorted versions of the same audio clip. In this paper, we design fingerprints addressing the above issues by extracting the audio fingerprints from the Spectral Flux of the clipped signal. Spectral Flux (SF) is a measure of how quickly the power spectrum of a signal is changing, calculated by comparing the power spectrum for one frame against the power spectrum from the previous frame. More precisely, it is usually calculated as the 2-norm (also known as the Euclidean distance) between the two normalised spectra. By using the AF as the feature of our algorithm we retrieval the audio clips from the database which has store some fingerprints computed previously. We test the robustness of the fingerprints under a large number of distortions. And the experimental results show that the proposed algorithm performance well in audio retrieval.
  • Keywords
    audio databases; audio signal processing; feature extraction; information retrieval; spectral analysis; 2-norm; Euclidean distance; SF; audio clip recognition; audio fingerprint; audio retrieval; dimensionality reduction; fingerprint extraction matching; frame power spectrum; spectral flux; Algorithm design and analysis; Audio recording; Databases; Fingerprint recognition; Noise; Robustness; Signal processing algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2012 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0173-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2012.6376781
  • Filename
    6376781