On Training Flexible Robots using Deep Reinforcement Learning