A Shortcut-aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs

Open in new window