Moving Off-the-Grid: Scene-Grounded Video Representations

Oct-10-2025, 19:12:42 GMT–Neural Information Processing Systems

Current vision models typically maintain a fixed correspondence between their representation structure and image space. Each layer comprises a set of tokens arranged "on-the-grid," which biases patches or tokens to encode information at

international conference, moog, representation, (16 more...)

Neural Information Processing Systems

Oct-10-2025, 19:12:42 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.92)
- Energy > Power Industry (0.41)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language > Large Language Model (0.67)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
e0e25d425450b6fc8e34380de71b3aee-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found