Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

Open in new window