Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

Open in new window