Meta-trained agents implement Bayes-optimal agents

Open in new window