Scaling up budgeted reinforcement learning