Title :
A simple music/voice separation method based on the extraction of the repeating musical structure
Author :
Rafii, Zafar ; Pardo, Bryan
Author_Institution :
EECS Dept., Northwestern Univ., Evanston, IL, USA
Abstract :
Repetition is a core principle in music. This is especially true for popular songs, generally marked by a noticeable repeating musical structure, over which the singer performs varying lyrics. On this basis, we propose a simple method for separating music and voice, by extraction of the repeating musical structure. First, the period of the repeating structure is found. Then, the spectrogram is segmented at period boundaries and the segments are averaged to create a repeating segment model. Finally, each time-frequency bin in a segment is compared to the model, and the mixture is partitioned using binary time-frequency masking by labeling bins similar to the model as the repeating background. Evaluation on a dataset of 1,000 song clips showed that this method can improve on the performance of an existing music/voice separation method without requiring particular features or complex frameworks.
Keywords :
music; source separation; spectral analysis; time-frequency analysis; binary time-frequency masking; complex frameworks; labeling bins; music separation method; period boundary; popular songs; repeating musical structure extraction; repeating segment model; repetition; singer; song clips; spectrogram; time-frequency bin; varying lyrics; voice separation method; Equations; Indexes; Mathematical model; Music; Periodic structures; Spectrogram; Time frequency analysis; Binary Time-Frequency Masking; Music/Voice Separation; Repeating Pattern;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946380