Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding

Open in new window