Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation
–Neural Information Processing Systems
Large language models (LLMs), despite their impressive performance across a wide range of tasks, often struggle to balance two competing objectives in open-ended text generation: fostering diversity and creativity while preserving logical coherence. Existing truncated sampling techniques, including temperature scaling, top- (nucleus) sampling, and min-sampling, aim to manage this trade-off.
Neural Information Processing Systems
Jun-11-2026, 08:07:34 GMT
- Technology: