QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation