DocumentCode
294760
Title
Speech recognition for image animation and coding
Author
Chou, Wu ; Chen, Homer H.
Author_Institution
AT&T Bell Labs., Murray Hill, NJ, USA
Volume
4
fYear
1995
fDate
9-12 May 1995
Firstpage
2253
Abstract
We discuss some issues related to acoustic assisted image coding and animation. An approach of talker independent acoustic assisted image coding and animation scheme is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions
Keywords
acoustic signal processing; computer animation; hearing; image coding; interpolation; signal sampling; smoothing methods; speech recognition; visual perception; acoustic assisted image coding; audio domain; auditory perception; high rate viseme sequence; image animation; image coding; image domain viseme interpolation; image domain viseme smoothing; look-ahead moving interpolation; look-back moving interpolation; oversampled viseme sequence; sliding window encoder; speech recognition; talker independent animation; talker independent image coding; visual perception; Animation; Bit rate; Decoding; Humans; Image coding; Image sequences; Interpolation; Mouth; Shape; Smoothing methods; Speech recognition; Visual perception;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479939
Filename
479939
Link To Document