Automatically Auditing Large Language Models via Discrete Optimization

Open in new window