Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models