Curiosity-driven Red-teaming for Large Language Models

Open in new window