Title :
Background Music Removal Based on Cepstrum Transformation for Popular Singer Identification
Author :
Tsai, Wei-Ho ; Lin, Hao-Ping
Author_Institution :
Dept. of Electron. Eng., Nat. Taipei Univ. of Technol., Taipei, Taiwan
fDate :
7/1/2011 12:00:00 AM
Abstract :
One major challenge of identifying singers in popular music recordings lies in how to reduce the interference of background accompaniment in trying to characterize the singer voice. Although a number of studies on automatic Singer IDentification (SID) from acoustic features have been reported, most systems to date, however, do not explicitly deal with the background accompaniment. This study proposes a background accompaniment removal approach for SID by exploiting the underlying relationships between solo singing voices and their accompanied versions in cepstrum. The relationships are characterized by a transformation estimated using a large set of accompanied singing generated by manually mixing solo singing with the accompaniments extracted from Karaoke VCDs. Such a transformation reflects the cepstrum variations of a singing voice before and after it is added with accompaniments. When an unknown accompanied voice is presented to our system, the transformation is performed to convert the cepstrum of the accompanied voice into a solo-voice-like one. Our experiments show that such a background removal approach improves the SID accuracy significantly; even when a test music recording involves sung language not covered in the data for estimating the transformation.
Keywords :
audio recording; music; Karaoke VCD; SID accuracy; acoustic features; automatic singer identification; background accompaniment removal approach; background music removal; cepstrum transformation; interference reduction; popular music recordings; popular singer identification; singer voice; solo singing voices; Cepstrum; Feature extraction; Indexes; Instruments; Materials; Testing; Training; Background accompaniment; cepstrum transformation; singer identification (SID);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2087752