You are here: indexactivitiestheme1projectsvideo

Microscope image analysis for automatic phases recognition in surgery


General purpose

The need for a better integration of the new generation of Computer-Assisted-Surgical (CAS) systems has been recently emphasized. One necessity to achieve this objective is to retrieve data from the Operating Room (OR) with different sensors, then to derive models from these data. Recently, the use of videos from cameras in the OR has demonstrated its efficiency. In this project, we propose a framework to assist in the development of systems for the automatic recognition of high level surgical tasks using microscope videos analysis. We are using machine learning based approaches along with computer vision techniques to automatically recognize surgical tasks from surgical videos (PhD of Florent Lalys). In collaboration with Carl Zeiss Medical Systems (, we are studying the use of this approach for automatic indexation of surgical videos.


The idea is to combine state-of-the-art computer vision techniques with time series analysis. The first step of the framework consisted in the definition of several visual cues for extracting semantic information, therefore characterizing each frame of the video. Five different pieces of image-based classifiers were therefore implemented. Specific pre-processing steps may also be applied for dedicated visual cue detection. Time series classification algorithms were then applied to model time-varying data. Dynamic Time Warping (DTW) and Hidden Markov Models (HMM) were tested. This association combined the advantages of all methods for better understanding of the problem.

Then, our framework was validated on two various datasets, including one specific types of neurosurgical interventions and one type of ophthalmological surgery. The following figures show the different surgical phases that have been defined by surgeons.

Definition of 6 surgical phases in Hypophyse surgeries :
Definition of 8 surgical phases in Cataract surgeries : 8_etape_cataracte.jpg

With this system, we are now able to recognize the surgical phase of every unknown image by computing its signature and then classifying it with machine learning techniques.This will enable the system to be used in clinical applications such as post-operative video indexation.



Here is a demonstration of the system, in the context of cataract surgeries :