Reinforcement Learning Controllers for Soft Robots using Learned Environments