Title :
Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services
Author :
Xie, Lei ; Wang, Yi ; Liu, Zhi-Qiang
Author_Institution :
City Univ. of Hong Kong, Kowloon
Abstract :
This paper presents a very low bit rate speech-to-video synthesizer, named lip assistant, to help hearing impaired people to better access multimedia services via lipreading. Lip assistant can automatically convert acoustic speech to lip parameters with a bit rate of 2.2 kbps, and decode them to video-realistic mouth animation on the fly. We use multi-stream HMMs (MSHMMs) and the principal component analysis (PCA) to model the audio-visual speech and the visual articulations, which are learned from AV facial recordings. Speech is converted to lip parameters with natural dynamics by an expectation maximization (EM)-based audio-to-lip converter. The video synthesizer generates video-realistic mouth animations from the encoded lip parameters via PCA expansion. Finally, mouth animation is superimposed on the original video as an assistant for hearing impaired viewers to make a better understanding on the audio-visual contents. Experimental results shows that lip assistant can significantly improve the speech intelligibility of both machines and humans.
Keywords :
computer animation; data visualisation; expectation-maximisation algorithm; handicapped aids; hidden Markov models; multimedia computing; principal component analysis; speech intelligibility; speech synthesis; video signal processing; PCA; expectation maximization algorithm; hearing impaired people; lip assistant; multimedia service; multistream HMM; principal component analysis; speech intelligibility; speech visualization; speech-to-video synthesizer; video-realistic mouth animation; Animation; Auditory system; Bit rate; Decoding; Hidden Markov models; Mouth; Principal component analysis; Speech; Synthesizers; Visualization;
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
DOI :
10.1109/ICSMC.2006.384815