WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Tu, Shangqing, Sun, Yuliang, Bai, Yushi, Yu, Jifan, Hou, Lei, Li, Juanzi
–arXiv.org Artificial Intelligence
To mitigate the potential misuse of large language models (LLMs), recent research has developed watermarking algorithms, which restrict the generation process to leave an invisible trace for watermark detection. Due to the two-stage nature of the task, most studies evaluate the generation and detection separately, thereby presenting a challenge in unbiased, thorough, and applicable evaluations. In this paper, we introduce WaterBench, the first comprehensive benchmark for LLM watermarks, in which we design three crucial factors: (1) For \textbf{benchmarking procedure}, to ensure an apples-to-apples comparison, we first adjust each watermarking method's hyper-parameter to reach the same watermarking strength, then jointly evaluate their generation and detection performance. (2) For \textbf{task selection}, we diversify the input and output length to form a five-category taxonomy, covering $9$ tasks. (3) For \textbf{evaluation metric}, we adopt the GPT4-Judge for automatically evaluating the decline of instruction-following abilities after watermarking. We evaluate $4$ open-source watermarks on $2$ LLMs under $2$ watermarking strengths and observe the common struggles for current methods on maintaining the generation quality. The code and data are available at \url{https://github.com/THU-KEG/WaterBench}.
arXiv.org Artificial Intelligence
Nov-13-2023
- Country:
- Africa > Ghana (0.05)
- Asia
- China > Beijing
- Beijing (0.04)
- Japan (0.04)
- Middle East > Iraq
- Erbil Governorate > Erbil (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Russia (0.04)
- China > Beijing
- Europe
- Lithuania (0.04)
- Russia (0.04)
- Sweden > Skåne County
- Malmö (0.04)
- United Kingdom
- North America
- Canada (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Colorado > Boulder County
- Longmont (0.04)
- California (0.04)
- Wyoming (0.04)
- Missouri > Jackson County
- Kansas City (0.04)
- Utah > Weber County
- Ogden (0.04)
- North Carolina
- Forsyth County > Winston-Salem (0.04)
- Guilford County > Greensboro (0.04)
- Oklahoma > Oklahoma County
- Oklahoma City (0.04)
- Hawaii (0.04)
- New York (0.04)
- Montana > Yellowstone County
- Billings (0.04)
- Texas (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Pennsylvania > Allegheny County
- Oceania > Australia (0.04)
- South America > Brazil
- Genre:
- Personal (0.92)
- Research Report > New Finding (0.46)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Materials > Metals & Mining
- Gold (1.00)
- Technology: