AITopics | Agents

Despite some success in the single-agent setting, offline multi-agent RL (MARL) remains to be a challenge. The large joint state-action space and the coupled multi-agent behaviors pose extra complexities for offline policy optimization.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

a3621ee907def47c1b952ade25c67698-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 03:27:01 GMT

arxiv preprint arxiv, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > Dominican Republic (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Leisure & Entertainment > Games (0.67)
Banking & Finance > Trading (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)

Add feedback

Conservative Offline Policy Adaptation in Multi-Agent Games

Neural Information Processing SystemsOct-9-2025, 03:26:17 GMT

We prove that CSP learns a near-optimal risk-free offline adaptation policy upon convergence.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Definition of Continual Reinforcement Learning

Neural Information Processing SystemsOct-9-2025, 02:46:13 GMT

In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Barbados (0.04)

Industry: Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

On the Interplay between Social Welfare and Tractability of Equilibria

Neural Information Processing SystemsOct-9-2025, 02:38:34 GMT

Nash equilibria can be effectively addressed.

artificial intelligence, equilibria, machine learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Mathematics of Computing (0.93)

Add feedback

Mechanism Design for Collaborative Normal Mean Estimation

Neural Information Processing SystemsOct-9-2025, 02:29:15 GMT

However, simply pooling everyone's data and sharing with each other can lead to free-riding [

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Spain > Galicia > Madrid (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Health & Medicine > Therapeutic Area (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)

Add feedback

Filters

Collaborating Authors

Agents

a7a7c0c92f195cce85f99768621ac6c0-Paper-Datasets_and_Benchmarks.pdf

a6d7226db2ff3643d8624624e3859c19-Paper-Conference.pdf

a5e4907a40c0dcb8433a35c714ba9d79-Paper-Conference.pdf

a5357781c204d4412e44ed9cbcdb08d5-Paper-Conference.pdf

Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization Xiangsen Wang

a3621ee907def47c1b952ade25c67698-Paper-Conference.pdf

Conservative Offline Policy Adaptation in Multi-Agent Games

A Definition of Continual Reinforcement Learning

On the Interplay between Social Welfare and Tractability of Equilibria

Mechanism Design for Collaborative Normal Mean Estimation