DocumentCode :
2588102
Title :
A improved speech synthesis system utilizing BPSO-based lip feature selection
Author :
Wang, Mengjun ; Wang, Xiangling ; Li, Gang
Author_Institution :
Sch. of Inf. Eng., HeBei Univ. of Technol., Tianjin, China
Volume :
3
fYear :
2011
fDate :
15-17 Oct. 2011
Firstpage :
1292
Lastpage :
1295
Abstract :
To get a higher lipreading recognition result in speech synthesis system driven by visual speech, Binary Particle Swarm Optimization (BPSO) algorithms is used to select the “optimal” lip feature subset. Experiments are carried out based on HMM with 4 states and 16 Gaussian mixture components in a small database for speaker-dependent case. Experiment results show that the integrated discriminate vector after feature selection obtained the information from the geometrical features and the pixel based features. Comparing with feature fusion based on concatenating, the recognition rates with feature selection based on BPSO are improved by as much as 2.42%.
Keywords :
Gaussian processes; biomedical optical imaging; feature extraction; hidden Markov models; image recognition; medical image processing; particle swarm optimisation; speech; speech recognition; Gaussian mixture component; HMM; binary particle swarm optimization algorithm; hidden Markov models; integrated discriminate vector; lip feature selection; lipreading recognition; pixel based feature; recognition rates; speaker dependent case; speech synthesis system; visual speech; Discrete cosine transforms; Feature extraction; Hidden Markov models; Image sequences; Speech; Vectors; Visualization; Binary Particle Swarm Optimization; Hidden Markov Model; feature Selection; normalized DCT coefficients; normalized geometrical feature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2011 4th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9351-7
Type :
conf
DOI :
10.1109/BMEI.2011.6098551
Filename :
6098551
Link To Document :
بازگشت