Redeeming Intrinsic Rewards via Constrained Optimization