Bringing Image Structure to Video via Frame-Clip Consistency of Object Tokens
–Neural Information Processing Systems
On the other hand, one does often have access to a small set of annotated images, either within or outside the domain of interest. Here we ask how such images can be leveraged for downstream video understanding tasks.
Neural Information Processing Systems
Nov-15-2025, 18:24:17 GMT
- Country:
- Asia > Middle East
- Israel > Tel Aviv District > Tel Aviv (0.04)
- North America > United States
- Utah > Salt Lake County > Salt Lake City (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.46)
- Technology: