مرکز منطقه ای اطلاع رساني علوم و فناوري - REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation

DocumentCode :

32800

Title :

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation

Author :

Rafii, Zafar ; Pardo, Bryan

Author_Institution :

Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA

Volume :

Issue :

fYear :

2013

fDate :

Jan. 2013

Firstpage :

Lastpage :

Abstract :

Repetition is a core principle in music. Many musical pieces are characterized by an underlying repeating structure over which varying elements are superimposed. This is especially true for pop songs where a singer often overlays varying vocals on a repeating accompaniment. On this basis, we present the REpeating Pattern Extraction Technique (REPET), a novel and simple approach for separating the repeating “background” from the non-repeating “foreground” in a mixture. The basic idea is to identify the periodically repeating segments in the audio, compare them to a repeating segment model derived from them, and extract the repeating patterns via time-frequency masking. Experiments on data sets of 1,000 song clips and 14 full-track real-world songs showed that this method can be successfully applied for music/voice separation, competing with two recent state-of-the-art approaches. Further experiments showed that REPET can also be used as a preprocessor to pitch detection algorithms to improve melody extraction.

Keywords :

music; speech processing; time-frequency analysis; REPET; full-track real-world songs; melody extraction improvement; music-voice separation; nonrepeating foreground; pitch detection algorithms; preprocessor; repeating background; repeating pattern extraction technique; time-frequency masking; Adaptation models; Estimation; Hidden Markov models; Music; Spectrogram; Speech; Speech processing; Melody extraction; music structure analysis; music/voice separation; repeating patterns;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2012.2213249

Filename :

6269059

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=32800