DocumentCode
3514546
Title
Data Fusion for Geometrical and Pixel Based Lip Feature
Author
Wang Mengjun ; Li Gang
Author_Institution
Sch. of Inf. Eng., HeBei Univ. of Technol., Tianjin, China
fYear
2010
fDate
28-29 Oct. 2010
Firstpage
301
Lastpage
304
Abstract
Lipreading is applied to synthesize speech for the speech-impaired people. To get a higher recognition result, data fusion with weighting coefficients at feature level is used to integrate the lip information from different kinds of lip features. Experiments are carried out based on HMM with different states and Gaussian mixture component in a small database for speaker-dependent case. Experiment results showed that the integrated discriminate vector after feature fusion obtains the information from the Geometrical feature vector of lip region and the DCT coefficients of lip´ ROI. With best weighting coefficients m: n=1.5:1, the recognition rate are improved by as much as 5.02% and 8.37%, respectively.
Keywords
Gaussian processes; discrete cosine transforms; feature extraction; handicapped aids; hidden Markov models; sensor fusion; speech synthesis; DCT coefficients; Gaussian mixture component; HMM; data fusion; geometrical based lip feature; integrated discriminate vector; lipreading; pixel based lip feature; speech synthesis; speech-impaired people; Discrete cosine transforms; Feature extraction; Hidden Markov models; Image sequences; Pixel; Speech; Visualization; Hidden Markov Model; data fusion; geometrical based feature vector; pixel based feature vector; weighting combination;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligence Information Processing and Trusted Computing (IPTC), 2010 International Symposium on
Conference_Location
Huanggang
Print_ISBN
978-1-4244-8148-4
Electronic_ISBN
978-0-7695-4196-9
Type
conf
DOI
10.1109/IPTC.2010.35
Filename
5663130
Link To Document