DocumentCode :
2429131
Title :
ROI processing for visual features extraction in lip-reading
Author :
Wang, Xiaoping ; Hao, Yufeng ; Fu, Degang ; Yuan, Chunwei
Author_Institution :
State Key Lab. of Bioelectron., Southeast Univ., Nanjing
fYear :
2008
fDate :
7-11 June 2008
Firstpage :
178
Lastpage :
181
Abstract :
Region of interest (ROI) is the key basis of visual features extraction in lip-reading process. In this paper, we discussed the ROI processing method and explored its impact on recognition accuracy with the comparison of four kinds of processed ROIs obtained by using four basic image processing methods: gray-scale normalization, difference enhancement, edge enhancement and image segmentation. Then recognition tasks for speaker-independent lip-reading were carried out by the aid of continuous hidden Markov model (CHMM). The experimental results show that for discrete cosine transform (DCT) based features, normalized gray-scale image can achieve the best recognition performance among these four ROIs.
Keywords :
discrete cosine transforms; edge detection; feature extraction; hidden Markov models; image segmentation; continuous hidden Markov model; difference enhancement; discrete cosine transform; edge enhancement; gray-scale normalization; image processing; image segmentation; region of interest; speaker-independent lip-reading; visual features extraction; Active shape model; Automatic speech recognition; Chromium; Discrete cosine transforms; Face detection; Feature extraction; Hidden Markov models; Image processing; Image recognition; Skin; Lip-reading; ROI; Visual Features Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Signal Processing, 2008 International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-2310-1
Electronic_ISBN :
978-1-4244-2311-8
Type :
conf
DOI :
10.1109/ICNNSP.2008.4590335
Filename :
4590335
Link To Document :
بازگشت