ConfGuard: A Simple and Effective Backdoor Detection for Large Language Models

Open in new window