• DocumentCode
    251028
  • Title

    Surgical tool attributes from monocular video

  • Author

    Kumar, Sudhakar ; Narayanan, Madusudanan Sathia ; Singhal, Purnima ; Corso, Jason J. ; Krovi, Venkat

  • Author_Institution
    Dept. of Mech. Eng., Univ. at Buffalo, Buffalo, NY, USA
  • fYear
    2014
  • fDate
    May 31 2014-June 7 2014
  • Firstpage
    4887
  • Lastpage
    4892
  • Abstract
    HD Video from the (monocular or binocular) endoscopic camera provides a rich real-time sensing channel from surgical site to the surgeon console in various Minimally Invasive Surgery (MIS) procedures. However, a real-time framework for video understanding would be critical for tapping into the rich information-content provided by the non-invasive and well-established digital endoscopic video-streaming modality. While contemporary research focuses on enhancing aspects such as tool-tracking within the challenging visual scenes, we consider the associated problem of using that rich (but often compromised) streaming visual data to discover the underlying semantic attributes of the tools. Directly analyzing the surgical videos to extract more realistic attributes online can aid in the decision-making and feedback aspects. We propose a novel probabilistic attribute labelling framework with Bayesian filtering to identify associated semantics (open/closed, stained with blood etc.) to ultimately give semantic feedback to the surgeon. Our robust video-understanding framework overcomes many of the challenges (tissue deformations, image specularities, clutter, tool-occlusion due to blood and/or organs) under realistic in-vivo surgical conditions. Specifically, this manuscript performs rigorous experimental analysis of the resulting method with varying parameters and different visual features on a data-corpus consisting of real surgical procedures performed on patients with da Vinci Surgical System [9].
  • Keywords
    Bayes methods; endoscopes; medical image processing; surgery; video streaming; Bayesian filtering; HD video; binocular endoscopic camera; da Vinci surgical system; digital endoscopic video-streaming modality; image specularity; minimally invasive surgery; monocular video; probabilistic attribute labelling framework; real-time sensing channel; semantic attribute; surgical tool attribute; tissue deformation; video understanding; Accuracy; Feature extraction; Semantics; Streaming media; Support vector machines; Surgery; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2014 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICRA.2014.6907575
  • Filename
    6907575