End-to-endMulti-modalVideoTemporalGrounding
–Neural Information Processing Systems
To integrate the three modalities more effectively and enable inter-modal learning, we design a dynamic fusion scheme with transformers to model the interactions between modalities.
Neural Information Processing Systems
Feb-11-2026, 19:56:39 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Vision (0.69)
- Information Technology > Artificial Intelligence