Watermarking Makes Language Models Radioactive Tom Sander
–Neural Information Processing Systems
Current methods like membership inference or active IP protection either work only in settings where the suspected text is known or do not provide reliable statistical guarantees. We discover that, on the contrary, it is possible to reliably determine if a language model was trained on synthetic data if that data is output by a watermarked LLM.
Neural Information Processing Systems
Aug-15-2025, 22:16:02 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Energy > Oil & Gas (1.00)
- Law (0.93)
- Technology: