Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term Constraints

Mar-19-2026, 02:35:20 GMT–Neural Information Processing Systems

In this paper, we consider the problem of online monotone DR-submodular maximization subject to long-term stochastic constraints. Specifically, at each round $t\in [T]$, after committing an action $\mathbf{x}_t$, a random reward $f_t(\mathbf{x}_t)$ and an unbiased gradient estimate of the point $\widetilde{\nabla}f_t(\mathbf{x}_t)$ (semi-bandit feedback) are revealed. Meanwhile, a budget of $g_t(\mathbf{x}_t)$, which is linear and stochastic, is consumed of its total allotted budget $B_T$.

artificial intelligence, constraint-based reasoning, proceedings, (8 more...)

Neural Information Processing Systems

Mar-19-2026, 02:35:20 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.43)