Identifying Surgical Instruments in Laparoscopy Using Deep Learning Instance Segmentation

August 29, 2025 · Declared Dead · 🏛 International Conference on Content-Based Multimedia Indexing

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Sabrina Kletz, Klaus Schoeffmann, Jenny Benois-Pineau, Heinrich Husslein arXiv ID 2508.21399 Category cs.CV: Computer Vision Cross-listed cs.MM Citations 39 Venue International Conference on Content-Based Multimedia Indexing Last Checked 4 months ago

Abstract

Recorded videos from surgeries have become an increasingly important information source for the field of medical endoscopy, since the recorded footage shows every single detail of the surgery. However, while video recording is straightforward these days, automatic content indexing - the basis for content-based search in a medical video archive - is still a great challenge due to the very special video content. In this work, we investigate segmentation and recognition of surgical instruments in videos recorded from laparoscopic gynecology. More precisely, we evaluate the achievable performance of segmenting surgical instruments from their background by using a region-based fully convolutional network for instance-aware (1) instrument segmentation as well as (2) instrument recognition. While the first part addresses only binary segmentation of instances (i.e., distinguishing between instrument or background) we also investigate multi-class instrument recognition (i.e., identifying the type of instrument). Our evaluation results show that even with a moderately low number of training examples, we are able to localize and segment instrument regions with a pretty high accuracy. However, the results also reveal that determining the particular instrument is still very challenging, due to the inherently high similarity of surgical instruments.