AI Control: Improving Safety Despite Intentional Subversion

Open in new window