RepliBench: Evaluating the Autonomous Replication Capabilities of Language Model Agents

Open in new window