DocumentCode :
30079
Title :
A Voice-Input Voice-Output Communication Aid for People With Severe Speech Impairment
Author :
Hawley, M.S. ; Cunningham, Sean P. ; Green, P.D. ; Enderby, P. ; Palmer, R. ; Sehgal, S. ; O´Neill, Peggy
Author_Institution :
Sch. of Health & Related Res., Univ. of Sheffield, Sheffield, UK
Volume :
21
Issue :
1
fYear :
2013
fDate :
Jan. 2013
Firstpage :
23
Lastpage :
31
Abstract :
A new form of augmentative and alternative communication (AAC) device for people with severe speech impairment-the voice-input voice-output communication aid (VIVOCA)-is described. The VIVOCA recognizes the disordered speech of the user and builds messages, which are converted into synthetic speech. System development was carried out employing user-centered design and development methods, which identified and refined key requirements for the device. A novel methodology for building small vocabulary, speaker-dependent automatic speech recognizers with reduced amounts of training data, was applied. Experiments showed that this method is successful in generating good recognition performance (mean accuracy 96%) on highly disordered speech, even when recognition perplexity is increased. The selected message-building technique traded off various factors including speed of message construction and range of available message outputs. The VIVOCA was evaluated in a field trial by individuals with moderate to severe dysarthria and confirmed that they can make use of the device to produce intelligible speech output from disordered speech input. The trial highlighted some issues which limit the performance and usability of the device when applied in real usage situations, with mean recognition accuracy of 67% in these circumstances. These limitations will be addressed in future work.
Keywords :
biomedical communication; biomedical equipment; diseases; handicapped aids; speech; speech processing; speech recognition; speech recognition equipment; speech synthesis; user interfaces; voice equipment; alternative communication device; augmentative communication device; disordered speech input; disordered speech recognition; highly disordered speech; intelligible speech output; mean recognition accuracy; message construction; real usage situations; recognition perplexity; refined key requirements; selected message-building technique; severe dysarthria; severe speech impairment; speaker-dependent automatic speech recognizers; synthetic speech; user-centered design; voice-input voice-output communication aid; Accuracy; Buildings; Hidden Markov models; Microphones; Speech; Speech recognition; Vocabulary; Augmentative and alternative communication; automatic speech recognition; dysarthria; voice output communication aid; Adult; Aged; Aged, 80 and over; Artificial Intelligence; Communication Aids for Disabled; Equipment Design; Equipment Failure Analysis; Female; Humans; Male; Middle Aged; Sound Spectrography; Speech Disorders; Speech Production Measurement; Speech Recognition Software; Treatment Outcome; Vocabulary, Controlled;
fLanguage :
English
Journal_Title :
Neural Systems and Rehabilitation Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1534-4320
Type :
jour
DOI :
10.1109/TNSRE.2012.2209678
Filename :
6259889
Link To Document :
بازگشت