• DocumentCode
    312200
  • Title

    Deriving articulatory representations from speech with various excitation modes

  • Author

    Richards, Hywel B. ; Mason, John S. ; Hunt, Melvyn J. ; Bridle, John S.

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Univ. of Wales, Swansea, UK
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1233
  • Abstract
    A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used
  • Keywords
    feedforward neural nets; multilayer perceptrons; speech coding; speech processing; speech synthesis; analysis-by-synthesis scheme; articulatory codebooks; articulatory representation derivation; complex dynamic model; computation; continuously-valued area estimates; excitation modes; fricative place position; multilayer perceptron; silence; speech; stop place position; vocal tract shape sequence estimation; voiced speech; voiceless speech; Background noise; Bandwidth; Character generation; Noise measurement; Parameter estimation; Resonance; Shape; Speech analysis; Speech synthesis; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607831
  • Filename
    607831