SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

Open in new window