Cascade Reward Sampling for Efficient Decoding-Time Alignment

Open in new window