• DocumentCode
    294760
  • Title

    Speech recognition for image animation and coding

  • Author

    Chou, Wu ; Chen, Homer H.

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    4
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    2253
  • Abstract
    We discuss some issues related to acoustic assisted image coding and animation. An approach of talker independent acoustic assisted image coding and animation scheme is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions
  • Keywords
    acoustic signal processing; computer animation; hearing; image coding; interpolation; signal sampling; smoothing methods; speech recognition; visual perception; acoustic assisted image coding; audio domain; auditory perception; high rate viseme sequence; image animation; image coding; image domain viseme interpolation; image domain viseme smoothing; look-ahead moving interpolation; look-back moving interpolation; oversampled viseme sequence; sliding window encoder; speech recognition; talker independent animation; talker independent image coding; visual perception; Animation; Bit rate; Decoding; Humans; Image coding; Image sequences; Interpolation; Mouth; Shape; Smoothing methods; Speech recognition; Visual perception;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479939
  • Filename
    479939