Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization

Open in new window