Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs

Open in new window