Title :
Concurrent estimation of singing voice F0 and phonemes by using spectral envelopes estimated from polyphonic music
Author :
Fujihara, Hiromasa ; Goto, Masataka
Abstract :
The scarcity of available multi-track recordings constitutes a severe constraint on the training of probabilistic models for voice extraction from polyphonic music. We propose a novel training method to estimate a spectral envelope of a singing voice that makes it possible to train the models from a polyphonic music without segregating a singing voice. We implement this method as an extension to the existing W-PST method, which concurrently estimates singing voice fundamental frequency (F0) and phoneme from polyphonic music. The novel training method is based on random sampling from probabilistic distributions. We conducted experiments on concurrent F0 and phoneme estimation and confirm the effectiveness of our method.
Keywords :
acoustic signal processing; music; probability; W-PST method; concurrent estimation; multitrack recordings; phonemes; polyphonic music; probabilistic distributions; probabilistic models; singing voice F0; singing voice fundamental frequency; spectral envelope estimation; voice extraction; Equations; Estimation; Harmonic analysis; Mathematical model; Noise; Probabilistic logic; Training; F0 estimation; Phoneme recognition; Singing voice; Spectral envelope estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946416