• DocumentCode
    463447
  • Title

    Generative Model of Voice in Noise for Structured Coding Applications

  • Author

    Jinachitra, P. ; Smith, Jeffrey O.

  • Author_Institution
    Center for Comput. Res. in Music & Acoust., Stanford Univ., CA, USA
  • Volume
    1
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    A generative model of a human voice is presented, based on many pseudo-physical considerations. For robustness, observation noise is also included in the model. An EM-algorithm framework for inference and learning is then described. An instance of approximate inference and subsequent learning presented allows an extraction of voice parameter which can be used for structured coding application. This set of parameters allows a great amount of compression as well as the flexibility in making modification to pitch, duration and breathiness, noise-free synthesis compared to other non-parametric approaches.
  • Keywords
    expectation-maximisation algorithm; speech coding; speech enhancement; EM-algorithm; generative model; human voice; noise-free synthesis; observation noise; structured coding; subsequent learning; Acoustic noise; Application software; Audio coding; Filters; Gaussian noise; Human voice; Noise generators; Noise robustness; Speech enhancement; Speech synthesis; Structured coding; generative model of voice; parametric voice modeling; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366671
  • Filename
    4217071