Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution