How OpenAI stress-tests its large language models

Open in new window