Generative RLHF-V: Learning Principles from Multi-modal Human Preference

Open in new window