AITopics | Agents

f0eb6568ea114ba6e293f903c34d7488-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 01:42:15 GMT

Several works haveshown this vulnerability via adversarial attacks, butexisting approaches onimproving therobustness ofDRL under this setting have limited success and lack for theoretical principles. We show that naively applying existing techniques on improving robustness for classification tasks,likeadversarialtraining,areineffectiveformanyRLtasks.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Information Technology (0.49)
Government > Military (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

c3e0c62ee91db8dc7382bde7419bb573-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 01:41:15 GMT

Theactiveagent trains (as a regular Double-DQN) up to the time of forking, at which point the passive agent is created asa'fork' (i.e.,with identical networkweights) oftheactiveagent.

artificial intelligence, experiment, machine learning, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Add feedback

9d823334fdccb62a544fa7643cf0615d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 01:01:33 GMT

equilibria, equilibrium, mediator, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States > Florida > Hillsborough County > Tampa (0.04)
(4 more...)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

9c7008aff45b5d8f0973b23e1a22ada0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 00:28:13 GMT

arxiv preprint arxiv, dataset, foundation model, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

TheSurprisingEffectivenessofPPOinCooperative Multi-AgentGames

Neural Information Processing SystemsFeb-11-2026, 00:26:28 GMT

Inthiswork, we carefully study the performance of PPO in cooperative multi-agent settings.

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > China > Beijing > Beijing (0.04)
Africa > Ethiopia (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learning in Non-Cooperative Configurable Markov Decision Processes Giorgia Ramponi ETH AI Center Zurich, Switzerland gramponi@ethz.ch Alberto Maria Metelli Politecnico di Milano Milan, Italy

Neural Information Processing SystemsFeb-11-2026, 00:05:40 GMT

Reinforcement Learning agent and a configurator that can modify some environmental parameters to improve the agent's performance.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.86)
Europe > Italy > Lombardy > Milan (0.40)
North America > United States (0.14)
(2 more...)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.64)

Add feedback

Adversarially Robust Decision Transformer

Neural Information Processing SystemsFeb-10-2026, 23:48:18 GMT

However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-maker and adversary. Training a probabilistic model conditioned on observed return to predict action can fail to generalize, as the trajectories that achieve a return in the dataset might have done so due to a suboptimal behavior adversary.

machine learning, natural language, reinforcement learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry:

Leisure & Entertainment > Games (0.68)
Information Technology (0.67)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
(3 more...)

Add feedback

c058f544c737782deacefa532d9add4c-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 23:48:00 GMT

algorithm, differential q-learning, formulation, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Calibrating " Cheap Signals " in Peer Review without a Prior

Neural Information Processing SystemsFeb-10-2026, 23:45:40 GMT

Detecting and correcting bias is challenging, as ratings are subjective and unverifiable. Unlike previous works relying on prior knowledge or historical data, we propose a one-shot noise calibration process without any prior information.

artificial intelligence, probability, reviewer, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

TheSensoryNeuronasaTransformer: Permutation-InvariantNeuralNetworksfor ReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 23:13:46 GMT

In complex systems, we often observe complex global behavior emerge from a collection of agents interacting with each other in their environment, with each individual agent acting only on locally available information, without knowing thefullpicture.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.54)

Add feedback

Filters

Collaborating Authors

Agents

f0eb6568ea114ba6e293f903c34d7488-Paper.pdf

c3e0c62ee91db8dc7382bde7419bb573-Supplemental.pdf

9d823334fdccb62a544fa7643cf0615d-Paper-Conference.pdf

9c7008aff45b5d8f0973b23e1a22ada0-Paper-Conference.pdf

TheSurprisingEffectivenessofPPOinCooperative Multi-AgentGames

Learning in Non-Cooperative Configurable Markov Decision Processes Giorgia Ramponi ETH AI Center Zurich, Switzerland gramponi@ethz.ch Alberto Maria Metelli Politecnico di Milano Milan, Italy

Adversarially Robust Decision Transformer

c058f544c737782deacefa532d9add4c-Paper.pdf

Calibrating " Cheap Signals " in Peer Review without a Prior

TheSensoryNeuronasaTransformer: Permutation-InvariantNeuralNetworksfor ReinforcementLearning