Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning