Title :
Visual-speech to text conversion applicable to telephone communication for deaf individuals
Author :
Heracleous, Panikos ; Ishiguro, Hiroshi ; Hagita, Norihiro
Abstract :
The access to communication technologies has become essential for the handicapped people. This study introduces the initial step of an automatic translation system able to translate visual speech used by deaf individuals to text, or auditory speech. A such a system would enable deaf users to communicate with each other and with normal-hearing people through telephone networks or through Internet by only using telephone devices equipped with simple cameras. In particular, this paper introduces automatic recognition and conversion to text of Cued Speech for French. Cued speech is a visual mode used for communication in the deaf society. Using hand shapes placed in different positions near the face as a complement to lipreading, all the sounds of a spoken language can be visually distinguished and perceived. Experimental results show high recognition rates for both isolated word and continuous phoneme recognition experiments in Cued Speech for French.
Keywords :
Internet telephony; handicapped aids; natural language processing; speech processing; speech recognition; speech synthesis; telephone equipment; Cued speech; Internet; auditory speech; automatic recognition; automatic translation system; continuous phoneme recognition; deaf user; handicapped people; isolated word recognition rate; lip reading; normal hearing people; spoken language; telephone communication technology; telephone device; telephone network; visual-speech to text conversion; Accuracy; Hidden Markov models; Lips; Shape; Speech; Speech recognition; Visualization; French Cued Speech; HMM; Telephone communication; automatic recognition; concatenative feature fusion; deaf;
Conference_Titel :
Telecommunications (ICT), 2011 18th International Conference on
Conference_Location :
Ayia Napa
Print_ISBN :
978-1-4577-0025-5
DOI :
10.1109/CTS.2011.5898904