Foundations of Top-$k$ Decoding For Language Models

Open in new window