DocumentCode :
3527741
Title :
Data-driven voice soruce waveform modelling
Author :
Thomas, Mark R P ; Gudnason, Jon ; Naylor, Patrick A.
Author_Institution :
Imperial Coll. London, London
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3965
Lastpage :
3968
Abstract :
This paper presents a data-driven approach to the modelling of voice source waveforms. The voice source is a signal that is estimated by inverse-filtering speech signals with an estimate of the vocal tract filter. It is used in speech analysis, synthesis, recognition and coding to decompose a speech signal into its source and vocal tract filter components. Existing approaches parameterize the voice source signal with physically- or mathematically-motivated models. Though the models are well-defined, estimation of their parameters is not well understood and few are capable of reproducing the large variety of voice source waveforms. Here we present a data-driven approach to classify types of voice source waveforms based upon their mel frequency cepstrum coefficients with Gaussian mixture modelling. A set of ldquoprototyperdquo waveform classes is derived from a weighted average of voice source cycles from real data. An unknown speech signal is then decomposed into its prototype components and resynthesized. Results indicate that with sixteen voice source classes, low resynthesis errors can be achieved.
Keywords :
Gaussian processes; filtering theory; parameter estimation; speech coding; speech recognition; speech synthesis; tracking filters; Gaussian mixture modelling; data-driven voice source waveform modelling; inverse-filtering speech signals; parameter estimation; speech analysis; speech coding; speech synthesis; vocal tract filter estimation; Cepstrum; Filters; Frequency; Mathematical model; Parameter estimation; Signal synthesis; Speech analysis; Speech coding; Speech recognition; Speech synthesis; LPC; Voice source; closed-phase analysis; inverse-filtering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960496
Filename :
4960496
Link To Document :
بازگشت