DocumentCode :
21655
Title :
Real-Time Task Recognition in Cataract Surgery Videos Using Adaptive Spatiotemporal Polynomials
Author :
Quellec, Gwenole ; Lamard, Mathieu ; Cochener, Beatrice ; Cazuguel, Guy
Author_Institution :
SFR ScInBioS, INSERM, Brest, France
Volume :
34
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
877
Lastpage :
887
Abstract :
This paper introduces a new algorithm for recognizing surgical tasks in real-time in a video stream. The goal is to communicate information to the surgeon in due time during a video-monitored surgery. The proposed algorithm is applied to cataract surgery, which is the most common eye surgery. To compensate for eye motion and zoom level variations, cataract surgery videos are first normalized. Then, the motion content of short video subsequences is characterized with spatiotemporal polynomials: a multiscale motion characterization based on adaptive spatiotemporal polynomials is presented. The proposed solution is particularly suited to characterize deformable moving objects with fuzzy borders, which are typically found in surgical videos. Given a target surgical task, the system is trained to identify which spatiotemporal polynomials are usually extracted from videos when and only when this task is being performed. These key spatiotemporal polynomials are then searched in new videos to recognize the target surgical task. For improved performances, the system jointly adapts the spatiotemporal polynomial basis and identifies the key spatiotemporal polynomials using the multiple-instance learning paradigm. The proposed system runs in real-time and outperforms the previous solution from our group, both for surgical task recognition ( Az = 0.851 on average, as opposed to Az = 0.794 previously) and for the joint segmentation and recognition of surgical tasks ( Az = 0.856 on average, as opposed to Az = 0.832 previously).
Keywords :
biomedical optical imaging; eye; polynomials; surgery; video streaming; adaptive spatiotemporal polynomial; cataract surgery video; eye motion variation; eye surgery; eye zoom level variation; joint segmentation; motion characterization; multiple-instance learning paradigm; real-time task recognition; short video subsequence motion content; surgical task recognition; video stream; video-monitored surgery; Motion segmentation; Polynomials; Real-time systems; Spatiotemporal phenomena; Surgery; Vectors; Videos; Cataract surgery; multiple-instance learning; real-time task recognition; spatiotemporal polynomials;
fLanguage :
English
Journal_Title :
Medical Imaging, IEEE Transactions on
Publisher :
ieee
ISSN :
0278-0062
Type :
jour
DOI :
10.1109/TMI.2014.2366726
Filename :
6942202
Link To Document :
بازگشت