Active Attacks: Red-teaming LLMs via Adaptive Environments

Open in new window