DocumentCode
251028
Title
Surgical tool attributes from monocular video
Author
Kumar, Sudhakar ; Narayanan, Madusudanan Sathia ; Singhal, Purnima ; Corso, Jason J. ; Krovi, Venkat
Author_Institution
Dept. of Mech. Eng., Univ. at Buffalo, Buffalo, NY, USA
fYear
2014
fDate
May 31 2014-June 7 2014
Firstpage
4887
Lastpage
4892
Abstract
HD Video from the (monocular or binocular) endoscopic camera provides a rich real-time sensing channel from surgical site to the surgeon console in various Minimally Invasive Surgery (MIS) procedures. However, a real-time framework for video understanding would be critical for tapping into the rich information-content provided by the non-invasive and well-established digital endoscopic video-streaming modality. While contemporary research focuses on enhancing aspects such as tool-tracking within the challenging visual scenes, we consider the associated problem of using that rich (but often compromised) streaming visual data to discover the underlying semantic attributes of the tools. Directly analyzing the surgical videos to extract more realistic attributes online can aid in the decision-making and feedback aspects. We propose a novel probabilistic attribute labelling framework with Bayesian filtering to identify associated semantics (open/closed, stained with blood etc.) to ultimately give semantic feedback to the surgeon. Our robust video-understanding framework overcomes many of the challenges (tissue deformations, image specularities, clutter, tool-occlusion due to blood and/or organs) under realistic in-vivo surgical conditions. Specifically, this manuscript performs rigorous experimental analysis of the resulting method with varying parameters and different visual features on a data-corpus consisting of real surgical procedures performed on patients with da Vinci Surgical System [9].
Keywords
Bayes methods; endoscopes; medical image processing; surgery; video streaming; Bayesian filtering; HD video; binocular endoscopic camera; da Vinci surgical system; digital endoscopic video-streaming modality; image specularity; minimally invasive surgery; monocular video; probabilistic attribute labelling framework; real-time sensing channel; semantic attribute; surgical tool attribute; tissue deformation; video understanding; Accuracy; Feature extraction; Semantics; Streaming media; Support vector machines; Surgery; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation (ICRA), 2014 IEEE International Conference on
Conference_Location
Hong Kong
Type
conf
DOI
10.1109/ICRA.2014.6907575
Filename
6907575
Link To Document