SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders

Jun-14-2026, 06:41:22 GMT–Neural Information Processing Systems

Watermarking LLM-generated text is critical for content attribution and misinformation prevention, yet existing methods compromise text quality and require white-box model access with logit manipulation or training, which exclude API-based models and multilingual scenarios.

artificial intelligence, large language model, natural language, (6 more...)

Neural Information Processing Systems

Jun-14-2026, 06:41:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)