HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model
–Neural Information Processing Systems
In this paper, we take an inspiration from human perception and explore a compositional approach for egocentric video representation.
Neural Information Processing Systems
Oct-10-2025, 11:21:24 GMT
- Country:
- Europe > Switzerland (0.04)
- North America > United States
- Arkansas > Washington County > Fayetteville (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (0.46)
- Technology: