No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices
–Neural Information Processing Systems
Advances in generative models have made it possible for AI-generated text, code, and images to mirror human-generated content in many applications. W atermark-ing, a technique that aims to embed information in the output of a model to verify its source, is useful for mitigating the misuse of such AI-generated content. However, we show that common design choices in LLM watermarking schemes make the resulting systems surprisingly susceptible to attack--leading to fundamental trade-offs in robustness, utility, and usability. To navigate these trade-offs, we rigorously study a set of simple yet effective attacks on common watermarking systems, and propose guidelines and defenses for LLM watermarking in practice.
Neural Information Processing Systems
Feb-18-2026, 19:07:13 GMT
- Country:
- Asia > Myanmar
- Tanintharyi Region > Dawei (0.04)
- North America
- Jamaica (0.04)
- United States
- California > Santa Barbara County
- Santa Barbara (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Virginia (0.04)
- California > Santa Barbara County
- Asia > Myanmar
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Technology: