Title :
Speech control in surgery: A field analysis and strategies
Author :
Schuller, Björn ; Can, Salman ; Feussner, Hubertus ; Wöllmer, Martin ; Arisc, Dejan ; Hörnler, Benedikt
Author_Institution :
Inst. for Human-Machine Commun., Germany
fDate :
June 28 2009-July 3 2009
Abstract :
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modelling. To overcome low recognition performance due to high noise levels during operations, the vocabulary was chosen to be highly limited and multiple noise reduction methods have been investigated. We show that the use of feature enhancement techniques, such as Histogram Equalization or a Switching Linear Dynamic Model capturing the dynamics of speech show a remarkable improvement in recognition accuracy. Considering a severe condition of usage of the recognition system with all appearing noise types, the mean accuracy can be raised from 89.67% to 91.16% with SLDM, and to 95.50% with HEQ enhancement.
Keywords :
cameras; interference suppression; medical robotics; noise abatement; robot vision; speech enhancement; speech recognition; surgery; SIMIS database; feature enhancement techniques; histogram equalization; multiple noise reduction methods; real life surgical operations; robot driven camera; speech control; surgery; switching linear dynamic model; Cameras; Databases; Histograms; Noise level; Noise reduction; Robot control; Robot vision systems; Speech analysis; Surgery; Vocabulary; Acoustic noise; Biomedical equipment safety; Robustness; Speech enhancement; Speech recognition;
Conference_Titel :
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4244-4290-4
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2009.5202719