Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach