Towards Sharper Object Boundaries in Self-Supervised Depth Estimation

Cecille, Aurélien, Duffner, Stefan, Davoine, Franck, Agier, Rémi, Neveu, Thibault

Nov-19-2025–arXiv.org Artificial Intelligence

Monocular depth estimation is a fundamental problem in computer vision with applications in autonomous driving, robotics and augmented reality. Recently, self-supervised learning methods have achieved impressive results by using view synthesis as a supervisory signal, but despite these advances, handling depth discontinuities remains challenging. In most scenes, foreground objects occlude the background, creating depth discontinuities at object boundaries. Conventional models assign a single depth value per pixel, but edge uncertainty often causes depth values to be averaged between foreground and background depths, blurring transitions and introducing artifacts in the point cloud (see Figure 2). To address this, we propose to represent per-pixel depth as a multimodal distribution, explicitly modeling both depths at boundaries, preserving sharp transitions and removing artifacts.

artificial intelligence, inductive learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Nov-19-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.46)

Genre:
- Research Report (0.50)

Industry:
- Automobiles & Trucks (0.34)
- Transportation > Ground
  - Road (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Image Understanding (0.66)
  - Machine Learning > Inductive Learning (0.54)
  - Representation & Reasoning > Uncertainty (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found