In VG VG VG VG In DU DU CU In CU MAOu MAOu MA MA In In ppppprrStStTTSSSSGGGGssuuuuutttt33tttttTTTT3R3R3R3R33RR+++RR+++OOOuuuOurrss

Jun-22-2026, 23:33:22 GMT–Neural Information Processing Systems

The family of feed-forward reconstruction model regresses pointmap of all input images to a reference frame coordinate system, along with other auxiliary outputs, in a single forward pass. However, we find that current models struggle with fine geometry and robustness due to (i) the scarcity of high-fidelity depth and pose supervision and (ii) the inherent geometric misalignment from multi-view pointmap regression. Fin3R jointly tackles two issues with an extra lightweight fine-tuning step. We freeze the decoder, which handles view matching, and fine-tune only the image encoder--the component dedicated to feature extraction. The encoder is enriched with fine geometric details distilled from a strong monocular teacher model on large, unlabeled datasets, using a custom, lightweight LoRA adapter.

artificial intelligence, estimation, machine learning, (16 more...)

Neural Information Processing Systems

Jun-22-2026, 23:33:22 GMT

Conferences PDF

Add feedback

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found