Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models

Open in new window