HeavyWater and SimplexWater: Distortion-free LLM Watermarks for Low-Entropy Distributions
–Neural Information Processing Systems
Large language model (LLM) watermarks enable authentication of text provenance, curb misuse of machine-generated text, and promote trust in AI systems. Current watermarks operate by changing the next-token predictions output by an LLM. The updated (i.e., watermarked) predictions depend on random side information produced, for example, by hashing previously generated tokens. LLM watermarking is particularly challenging in low-entropy generation tasks -- such as coding -- where next-token predictions are near-deterministic. In this paper, we propose an optimization framework for watermark design.
Neural Information Processing Systems
Jun-13-2026, 11:26:06 GMT