LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss

Mar-27-2025, 12:08:06 GMT–Neural Information Processing Systems

Image feature extractors are rendered substantially more useful if different views of the same 3D location yield similar features while still being distinct from other locations. A feature extractor that achieves this goal even under significant viewpoint changes must recognise not just semantic categories in a scene, but also understand how different objects relate to each other in three dimensions. Existing work addresses this task by posing it as a patch retrieval problem, training the extracted features to facilitate retrieval of all image patches that project from the same 3D location. However, this approach uses a loss formulation that requires substantial memory and computation resources, limiting its applicability for largescale training. We present a method for memory-efficient learning of locationconsistent features that reformulates and approximates the smooth average precision objective.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Mar-27-2025, 12:08:06 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Neural Networks (1.00)
      - Performance Analysis (0.66)
      - Statistical Learning (0.68)
    - Natural Language > Text Processing (0.66)
    - Representation & Reasoning (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found