Causal Reasoning from Meta-reinforcement Learning