Reinforcement Learning with Convex Constraints
–Neural Information Processing Systems
In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints.
Neural Information Processing Systems
Feb-12-2026, 20:21:53 GMT