Enhancing LLM Watermark Resilience Against Both Scrubbing and Spoofing Attacks

Jun-11-2026, 07:52:22 GMT–Neural Information Processing Systems

Watermarking is widely regarded as a promising defense against the misuse of large language models (LLMs); however, existing methods are fundamentally constrained by their vulnerability to scrubbing and spoofing attacks. This vulnerability stems from an inherent trade-off governed by watermark window size: smaller windows resist scrubbing better but are easier to reverse-engineer, enabling low-cost statistics-based spoofing attacks. This work expands the trade-off boundary by introducing a novel mechanism, equivalent texture keys, where multiple tokens within a watermark window can independently support the detection.

large language model, natural language, proceedings, (3 more...)

Neural Information Processing Systems

Jun-11-2026, 07:52:22 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.90)

Technology:
- Information Technology
  - Security & Privacy (0.90)
  - Artificial Intelligence > Natural Language
    - Large Language Model (0.66)