DocumentCode
3328454
Title
Speech synthesis from real time ultrasound images of the tongue
Author
Denby, Bruce ; Stone, Maureen
Author_Institution
Lab. des Instruments et Systemes, Univ. Pierre et Marie Curie, Paris, France
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
A machine learning technique is used to match reconstructed tongue contours in 30 frame per second ultrasound images to speaker vocal tract parameters obtained from a synchronized audio track. Speech synthesized using the learned parameters and noise as an activation function displays many of the time and frequency domain characteristics of the original audio, and, for isolated passages, is remarkably clear - although no articulators other than the tongue are included.
Keywords
biomedical ultrasonics; image reconstruction; image sequences; learning (artificial intelligence); medical image processing; speech synthesis; audio track; machine learning technique; medical ultrasound; real time ultrasound images; speech synthesis; tongue contours reconstruction; vocal tract parameters; Biomedical imaging; Data mining; Data visualization; GSM; Instruments; Speech codecs; Speech enhancement; Speech synthesis; Tongue; Ultrasonic imaging;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326078
Filename
1326078
Link To Document