Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods

Open in new window