مرکز منطقه ای اطلاع رساني علوم و فناوري - Development of speechreading supplements based on automatic speech recognition

DocumentCode :

1312096

Title :

Development of speechreading supplements based on automatic speech recognition

Author :

Duchnowski, Paul ; Lum, David S. ; Krause, Jean C. ; Sexton, Matthew G. ; Bratakos, Maroula S. ; Braida, Louis D.

Author_Institution :

Res. Lab. of Electron., MIT, Cambridge, MA, USA

Volume :

Issue :

fYear :

2000

fDate :

4/1/2000 12:00:00 AM

Firstpage :

487

Lastpage :

496

Abstract :

In manual-cued speech (MCS) a speaker produces hand gestures to resolve ambiguities among speech elements that are often confused by speechreaders. The shape of the hand distinguishes among consonants; the position of the hand relative to the face distinguishes among vowels. Experienced receivers of MCS achieve nearly perfect reception of everyday connected speech. MCS has been taught to very young deaf children and greatly facilitates language learning, communication, and general education. This manuscript describes a system that can produce a form of cued speech automatically in real time and reports on its evaluation by trained receivers of MCS. Cues are derived by a hidden markov models (HMM)-based speaker-dependent phonetic speech recognizer that uses context-dependent phone models and are presented visually by superimposing animated handshapes on the face of the talker. The benefit provided by these cues strongly depends on articulation of hand movements and on precise synchronization of the actions of the hands and the face, Using the system reported here, experienced cue receivers can recognize roughly two-thirds of the keywords in cued low-context sentences correctly, compared to roughly one-third by speechreading alone (SA). The practical significance of these improvements is to support fairly normal rates of reception of conversational speech, a task that is often difficult via SA

Keywords :

handicapped aids; hidden Markov models; speech recognition; ambiguities resolution; autocuer; automatic speech recognition; consonants; context-dependent phone models; cued speech; everyday connected speech; general education; hand movements articulation; hand shape; language learning; manual-cued speech; speaker-dependent phonetic speech recognizer; speechreading supplements development; transliteration; very young deaf children; Automatic speech recognition; Context modeling; Deafness; Face recognition; Facial animation; Hidden Markov models; Real time systems; Shape; Speech analysis; Speech recognition;

fLanguage :

English

Journal_Title :

Biomedical Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

0018-9294

Type :

jour

DOI :

10.1109/10.828148

Filename :

828148

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1312096