M3PO: Multimodal-Model-Guided Preference Optimization for Visual Instruction Following

Open in new window