Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution

Open in new window