DiPmark: A Stealthy, Efficient and Resilient Watermark for Large Language Models

Wu, Yihan, Hu, Zhengmian, Zhang, Hongyang, Huang, Heng

Oct-11-2023–arXiv.org Artificial Intelligence

Watermarking techniques offer a promising way to secure data via embedding covert information into the data. A paramount challenge in the domain lies in preserving the distribution of original data during watermarking. Our research extends and refines existing watermarking framework, placing emphasis on the importance of a distribution-preserving (DiP) watermark. Contrary to the current strategies, our proposed DiPmark preserves the original token distribution during watermarking (stealthy), is detectable without access to the language model API or weights (efficient), and is robust to moderate changes of tokens (resilient). This is achieved by incorporating a novel reweight strategy, combined with a hash function that assigns unique \textit{i.i.d.} ciphers based on the context. The empirical benchmarks of our approach underscore its stealthiness, efficiency, and resilience, making it a robust solution for watermarking tasks that demand impeccable quality preservation.

efficient and resilient watermark, language model, stealthy, (1 more...)

arXiv.org Artificial Intelligence

Oct-11-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.40)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence > Natural Language
    - Large Language Model (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found