HeavyWaterand SimplexWater: Distortion-free LLM Watermarks for Low-Entropy Distributions
–Neural Information Processing Systems
Large language model (LLM) watermarks enable authentication of text provenance, curb misuse of machine-generated text, and promote trust in AI systems. Current watermarks operate by changing the next-token predictions output by an LLM. The updated (i.e., watermarked) predictions depend on random side information produced, for example, by hashing previously generated tokens. LLM watermarking is particularly challenging when next-token predictions are near-deterministic. In fact, over 90% of next-token distributions are low-entropy, with more than half of the probability mass on a single token.
Neural Information Processing Systems
Jun-20-2026, 10:17:29 GMT
- Country:
- North America > United States (0.67)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Technology: