AITopics | primal-dual algorithm

In contrast to the advances in characterizing the sample complexity for solving Markov decision processes (MDPs), the optimal statistical complexity for solving constrained MDPs (CMDPs) remains unknown. We resolve this question by providing minimax upper and lower bounds on the sample complexity for learning near-optimal policies in a discounted CMDP with access to a generative model (simulator). In particular, we design a model-based algorithm that addresses two settings: (i) relaxed feasibility, where small constraint violations are allowed, and (ii) strict feasibility, where the output policy is required to satisfy the constraint.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.46)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Add feedback

047397849f63b4fcfced4ff720159f3d-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 06:51:56 GMT

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Industry: Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

A primal-dual method for conic constrained distributed optimization problems

Necdet Serhat Aybat, Erfan Yazdandoost Hamedani

Neural Information Processing SystemsMar-23-2026, 12:42:45 GMT

We consider cooperative multi-agent consensus optimization problems over anundirected network of agents, where only those agents connected by an edgecan directly communicate. The objective is to minimize the sum of agent-specific composite convex functions over agent-specific private conic constraintsets; hence, the optimal consensus decision should lie in the intersection of theseprivate sets. We provide convergence rates in sub-optimality, infeasibility andconsensus violation; examine the effect of underlying network topology on theconvergence rates of the proposed decentralized algorithms; and show how to ex-tend these methods to handle time-varying communication networks.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Europe (0.28)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Stochastic Variance Reduced Primal Dual Algorithms for Empirical Composition Optimization

Adithya M Devraj, Jianshu Chen

Neural Information Processing SystemsFeb-11-2026, 17:55:42 GMT

We exploit the richstructures ofthereformulated problem anddevelopastochastic primal-dual algorithm, SVRPDA-I, to solve the problem efficiently.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Bellevue (0.05)
North America > United States > Florida > Alachua County > Gainesville (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

a3f8f584febcc88ed8cdeb30b096db34-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 03:13:07 GMT

algorithm, constraint, markov game, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.67)

Add feedback

XXXXX

XXX

Neural Information Processing SystemsFeb-7-2026, 14:34:13 GMT

Bandits, Reinforcement Learning

algorithm, probability, value function, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > Canada > Alberta (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

XXXXX

XXX

Neural Information Processing SystemsFeb-7-2026, 14:34:09 GMT

There have been multiple recent approaches to obtain a near-optimal policy in CMDPs in the regret-minimization or PAC-RL settings [13, 38, 9, 19, 31, 22, 36, 12, 15, 16, 11].

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Filters

Collaborating Authors

primal-dual algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

5616060fb8ae85d93f334e7267307664-Paper.pdf

310b60949d2b6096903d7e8a539b20f5-Paper.pdf

XXXXX

XXXXX

047397849f63b4fcfced4ff720159f3d-Paper-Conference.pdf

A primal-dual method for conic constrained distributed optimization problems

Stochastic Variance Reduced Primal Dual Algorithms for Empirical Composition Optimization

a3f8f584febcc88ed8cdeb30b096db34-Paper-Conference.pdf

XXXXX

XXXXX