End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning