• DocumentCode
    3328454
  • Title

    Speech synthesis from real time ultrasound images of the tongue

  • Author

    Denby, Bruce ; Stone, Maureen

  • Author_Institution
    Lab. des Instruments et Systemes, Univ. Pierre et Marie Curie, Paris, France
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    A machine learning technique is used to match reconstructed tongue contours in 30 frame per second ultrasound images to speaker vocal tract parameters obtained from a synchronized audio track. Speech synthesized using the learned parameters and noise as an activation function displays many of the time and frequency domain characteristics of the original audio, and, for isolated passages, is remarkably clear - although no articulators other than the tongue are included.
  • Keywords
    biomedical ultrasonics; image reconstruction; image sequences; learning (artificial intelligence); medical image processing; speech synthesis; audio track; machine learning technique; medical ultrasound; real time ultrasound images; speech synthesis; tongue contours reconstruction; vocal tract parameters; Biomedical imaging; Data mining; Data visualization; GSM; Instruments; Speech codecs; Speech enhancement; Speech synthesis; Tongue; Ultrasonic imaging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326078
  • Filename
    1326078