AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

In most cases, the game designer is expected to first learn about the agents

Neural Information Processing SystemsNov-15-2025, 06:22:21 GMT

We would like to thank all reviewers for reading our paper and providing constructive comments. Sometimes, the primary interest is to understand agent behaviors, and hence only the learning mode is needed. Alternatively, when all game inputs are known, the focus is on the intervention mode. In the final version, we will (i) explain in 2.1 how these We agree that it is neither rigorous nor necessary to assert that "most" Our work is inspired by the current interests on complex optimization-based layers. It is the first to treat VIs as individual layers in the end-to-end framework.

designer, game designer, vi problem, (16 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.41)

Technology:

Information Technology > Artificial Intelligence > Games > Computer Games (0.41)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Add feedback

PerSim: Data-efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators

Neural Information Processing SystemsNov-15-2025, 06:18:18 GMT

We perform extensive experiments across several benchmark environments and RL methods.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)

Add feedback

Communication Efficient Distributed Learning for Kernelized Contextual Bandits

Neural Information Processing SystemsNov-15-2025, 05:56:24 GMT

Hilbert space (RKHS), i.e., the expected reward is linear w.r.t. an action feature map of possibly

algorithm, bandit, synchronization, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
North America > United States > Oregon (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Decentralized Q-Learning in Zero-sum Markov Games

Neural Information Processing SystemsNov-15-2025, 05:32:39 GMT

We study multi-agent reinforcement learning (MARL) in infinite-horizon discounted zero-sum Markov games.

algorithm, assumption 2, markov game, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)

Genre: Overview (0.68)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.67)

Add feedback

8d9a6e908ed2b731fb96151d9bb94d49-Supplemental.pdf

Neural Information Processing SystemsNov-15-2025, 01:24:06 GMT

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.31)

Add feedback

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

Neural Information Processing SystemsNov-15-2025, 01:06:22 GMT

V100 GPUs to train the models. Consortium and are licensed under a Creative Commons Attribution 4.0 License. Similarly, for evaluating the agent listener with a human speaker, each agent evaluates 400 human utterances in Fig 5b. In Fig 10, we present the results of the human evaluation on the text game. Sec 4.3, we show that agents trained using our method beat all prior baselines when paired with both The blue bars show the standard deviation across all agents present in the buffer.

artificial intelligence, machine learning, utterance, (19 more...)

Neural Information Processing Systems

Industry: Information Technology (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.55)

Add feedback

7ed2d3454c5eea71148b11d0c25104ff-Supplemental.pdf

Neural Information Processing SystemsNov-14-2025, 17:21:41 GMT

agent, river tile, sequence, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Game Theory (0.68)

Add feedback

7ed2d3454c5eea71148b11d0c25104ff-Paper.pdf

Neural Information Processing SystemsNov-14-2025, 17:21:37 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
Asia > Middle East > Jordan (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Hao Ma

Neural Information Processing SystemsNov-14-2025, 07:06:48 GMT

Reinforcement learning (RL) has emerged as a pivotal technique for fine-tuning large language models (LLMs) on specific tasks. However, prevailing RL fine-tuning methods predominantly rely on PPO and its variants. Though these algorithms are effective in general RL settings, they often exhibit suboptimal performance and vulnerability to distribution collapse when applied to the fine-tuning of LLMs.

kl divergence, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country: