DocumentCode
3425373
Title
Development of a visual speech synthesizer via second-order isomorphism
Author
Jiang, Jintao ; Aronoff, Justin M. ; Bernstein, Lynne E.
Author_Institution
Dept. of Commun., Neurosci. House Ear Inst., Los Angeles, CA
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4677
Lastpage
4680
Abstract
The goals of this study were to evaluate the synthesis of visible speech that was based on 3-D motion data using second-order isomorphism. To do this, word stimuli were generated for perceptual discrimination and identification tasks. Discrimination trials were based on word-pairs that were predicted to be at four levels of perceptual dissimilarity. Results from the discrimination tasks indicated that visual synthetic speech perception maintained the dissimilarity structure of visual natural speech perception. This study demonstrated that the relatively sparse 3-D representations of face motion could be used to synthesize visual speech that perceptually approximate visual natural speech, suggesting that synthesizer development and psychophysics can benefit mutually when the goals are aligned.
Keywords
speech processing; speech synthesis; 3D motion data; face motion; perceptual discrimination; perceptual dissimilarity; second-order isomorphism; visual natural speech perception; visual speech synthesizer; visual synthetic speech perception; word-pairs; Acoustic devices; Deafness; Humans; Natural languages; Optical feedback; Optical sensors; Signal synthesis; Speech analysis; Speech synthesis; Synthesizers; Visual speech synthesis; dissimilarity; second-order isomorphism; visual speech perception;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518700
Filename
4518700
Link To Document