Watermarking Makes Language Models Radioactive Tom Sander

Aug-15-2025, 22:16:02 GMT–Neural Information Processing Systems

Current methods like membership inference or active IP protection either work only in settings where the suspected text is known or do not provide reliable statistical guarantees. We discover that, on the contrary, it is possible to reliably determine if a language model was trained on synthetic data if that data is output by a watermarked LLM.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Aug-15-2025, 22:16:02 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Energy > Oil & Gas (1.00)
- Law (0.93)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (0.93)
    - Machine Learning
      - Neural Networks > Deep Learning (0.68)
      - Performance Analysis > Accuracy (0.45)

Duplicate Docs Excel Report

Title
2567c95fd41459a98a73ba893775d22a-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found