مرکز منطقه ای اطلاع رساني علوم و فناوري - Singing-voice separation from monaural recordings using robust principal component analysis

DocumentCode :

3142883

Title :

Singing-voice separation from monaural recordings using robust principal component analysis

Author :

Huang, Po-Sen ; Chen, Scott Deeann ; Smaragdis, Paris ; Hasegawa-Johnson, Mark

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

Lastpage :

Abstract :

Separating singing voices from music accompaniment is an important task in many applications, such as music information retrieval, lyric recognition and alignment. Music accompaniment can be assumed to be in a low-rank subspace, because of its repetition structure; on the other hand, singing voices can be regarded as relatively sparse within songs. In this paper, based on this assumption, we propose using robust principal component analysis for singing-voice separation from music accompaniment. Moreover, we examine the separation result by using a binary time-frequency masking method. Evaluations on the MIR-1K dataset show that this method can achieve around 1~1.4 dB higher GNSDR compared with two state-of-the-art approaches without using prior training or requiring particular features.

Keywords :

audio recording; music; principal component analysis; source separation; speech intelligibility; time-frequency analysis; GNSDR; MIR-1K dataset; binary time-frequency masking method; low-rank subspace; lyric alignment; lyric recognition; monaural recordings; music accompaniment; music information retrieval; repetition structure; robust principal component analysis; singing-voice separation; state-of-the-art approaches; Multiple signal classification; Music; Principal component analysis; Robustness; Sparse matrices; Speech; Time frequency analysis; Music/Voice Separation; Robust Principal Component Analysis; Time-Frequency Masking;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6287816

Filename :

6287816

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3142883