Real-time Capable Learning-based Visual Tool Pose Correction via Differentiable Simulation
–arXiv.org Artificial Intelligence
--Autonomy in Minimally Invasive Robotic Surgery (MIRS) has the potential to reduce surgeon cognitive and task load, thereby increasing procedural efficiency. However, implementing accurate autonomous control can be difficult due to poor end-effector proprioception, a limitation of their cable-driven mechanisms. Although the robot may have joint encoders for the end-effector pose calculation, various non-idealities make the entire kinematics chain inaccurate. Modern vision-based pose estimation methods lack real-time capability or can be hard to train and generalize. In this work, we demonstrate a real-time capable, vision transformer-based pose estimation approach that is trained using end-to-end differentiable kinematics and rendering in simulation. We demonstrate the potential of this method to correct for noisy pose estimates in simulation, with the longer term goal of verifying the sim-to-real transferability of our approach. The da Vinci Surgical System has been widely applied into different kinds of MIRS procedures in specializations such as, urologic [1], gynecologic [2], and cardiothoracic [3] surgery.
arXiv.org Artificial Intelligence
May-15-2025
- Country:
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Health & Medicine
- Health Care Technology (1.00)
- Surgery (1.00)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.88)
- Robots (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence