Less is More? Revisiting the Importance of Frame Rate in Real-Time Zero-Shot Surgical Video Segmentation
Ozbulak, Utku, Mousavi, Seyed Amir, Tozzi, Francesca, Rashidian, Nikdokht, Willaert, Wouter, De Neve, Wesley, Vankerschaver, Joris
–arXiv.org Artificial Intelligence
Real-time video segmentation is a promising feature for AI-assisted surgery, providing intraoperative guidance by identifying surgical tools and anatomical structures. However, deploying state-of-the-art segmentation models, such as SAM2, in real-time settings is computationally demanding, which makes it essential to balance frame rate and segmentation performance. In this study, we investigate the impact of frame rate on zero-shot surgical video segmentation, evaluating SAM2's effectiveness across multiple frame sampling rates for cholecystectomy procedures. Surprisingly, our findings indicate that in conventional evaluation settings, frame rates as low as a single frame per second can outperform 25 FPS, as fewer frames smooth out segmentation inconsistencies. However, when assessed in a real-time streaming scenario, higher frame rates yield superior temporal coherence and stability, particularly for dynamic objects such as surgical graspers. Finally, we investigate human perception of real-time surgical video segmentation among professionals who work closely with such data and find that respondents consistently prefer high FPS segmentation mask overlays, reinforcing the importance of real-time evaluation in AI-assisted surgery.
arXiv.org Artificial Intelligence
Feb-28-2025
- Country:
- Asia > South Korea
- Europe
- Belgium > Flanders
- East Flanders > Ghent (0.05)
- France > Grand Est
- Bas-Rhin > Strasbourg (0.04)
- Belgium > Flanders
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine
- Health Care Technology (1.00)
- Surgery (1.00)
- Health & Medicine
- Technology: