DocumentCode
177633
Title
An audio fingerprinting system for live version identification using image processing techniques
Author
Rafii, Zafar ; Coover, Bob ; Jinyu Han
Author_Institution
Northwestern Univ. Evanston, Evanston, IL, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
644
Lastpage
648
Abstract
Suppose that you are at a music festival checking on an artist, and you would like to quickly know about the song that is being played (e.g., title, lyrics, album, etc.). If you have a smartphone, you could record a sample of the live performance and compare it against a database of existing recordings from the artist. Services such as Shazam or SoundHound will not work here, as this is not the typical framework for audio fingerprinting or query-by-humming systems, as a live performance is neither identical to its studio version (e.g., variations in instrumentation, key, tempo, etc.) nor it is a hummed or sung melody. We propose an audio fingerprinting system that can deal with live version identification by using image processing techniques. Compact fingerprints are derived using a log-frequency spectrogram and an adaptive thresholding method, and template matching is performed using the Hamming similarity and the Hough Transform.
Keywords
Hough transforms; audio signal processing; fingerprint identification; image segmentation; Hamming similarity; Hough Transform; adaptive thresholding method; audio fingerprinting system; compact fingerprints; image processing techniques; live version identification; log-frequency spectrogram; music festival; smartphone; template matching; Degradation; Robustness; Spectrogram; Speech; Speech processing; Time-frequency analysis; Transforms; Adaptive thresholding; Constant Q Transform; audio fingerprinting; cover identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853675
Filename
6853675
Link To Document