DocumentCode
463447
Title
Generative Model of Voice in Noise for Structured Coding Applications
Author
Jinachitra, P. ; Smith, Jeffrey O.
Author_Institution
Center for Comput. Res. in Music & Acoust., Stanford Univ., CA, USA
Volume
1
fYear
2007
fDate
15-20 April 2007
Abstract
A generative model of a human voice is presented, based on many pseudo-physical considerations. For robustness, observation noise is also included in the model. An EM-algorithm framework for inference and learning is then described. An instance of approximate inference and subsequent learning presented allows an extraction of voice parameter which can be used for structured coding application. This set of parameters allows a great amount of compression as well as the flexibility in making modification to pitch, duration and breathiness, noise-free synthesis compared to other non-parametric approaches.
Keywords
expectation-maximisation algorithm; speech coding; speech enhancement; EM-algorithm; generative model; human voice; noise-free synthesis; observation noise; structured coding; subsequent learning; Acoustic noise; Application software; Audio coding; Filters; Gaussian noise; Human voice; Noise generators; Noise robustness; Speech enhancement; Speech synthesis; Structured coding; generative model of voice; parametric voice modeling; speech enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366671
Filename
4217071
Link To Document