Best-of-N through the Smoothing Lens: KL Divergence and Regret Analysis

Open in new window