DocumentCode
730089
Title
A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds
Author
Rafii, Zafar ; Liutkus, Antoine ; Pardo, Bryan
Author_Institution
Gracenote, Media Technol. Lab., Emeryville, CA, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
271
Lastpage
275
Abstract
Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the audio mixture. On this basis, we propose a simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. A user selects a region in the log-frequency spectrogram of an audio recording from which she/he wishes to recover a repeating pattern masked by an undesired element (e.g., a note masked by a cough). The selected region is then cross-correlated with the spectrogram to identify similar regions where the underlying pattern repeats. The identified regions are finally averaged over their repetitions and the repeating pattern is recovered.
Keywords
audio signal processing; graphical user interfaces; speech processing; audio mixture; audio recording; functional graphical user interface; log-frequency spectrogram; median filter; pattern recovery; simple user interface system; Audio recording; Noise; Source separation; Spectrogram; Speech; Time-frequency analysis; Transforms; Constant Q Transform; audio source separation; median filter; normalized 2-d cross-correlation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7177974
Filename
7177974
Link To Document