AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Adversarially Robust Decision Transformer

Neural Information Processing SystemsFeb-10-2026, 23:48:18 GMT

However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-maker and adversary. Training a probabilistic model conditioned on observed return to predict action can fail to generalize, as the trajectories that achieve a return in the dataset might have done so due to a suboptimal behavior adversary.

machine learning, natural language, reinforcement learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Industry:

Leisure & Entertainment > Games (0.68)
Information Technology (0.67)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
(3 more...)

Add feedback

c058f544c737782deacefa532d9add4c-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 23:48:00 GMT

algorithm, differential q-learning, formulation, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

ec3183a7f107d1b8dbb90cb3c01ea7d5-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 23:47:44 GMT

agent, algorithm, training task, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > West Yorkshire > Leeds (0.04)
North America > United States (0.04)
North America > Canada (0.04)

Industry: Energy (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Model-basedSafeDeepReinforcementLearningviaa ConstrainedProximalPolicyOptimizationAlgorithm

Neural Information Processing SystemsFeb-10-2026, 23:46:42 GMT

During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents perform asignificant number ofrandom exploratory steps.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: Asia > India > Karnataka > Bengaluru (0.05)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Add feedback

DesignofExperiments forStochasticContextualLinearBandits

Neural Information Processing SystemsFeb-10-2026, 23:38:06 GMT

In the stochastic linear contextual bandit setting there exist several minimax procedures for exploration with policies that are reactive to the data being acquired.

data mining, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Stanford (0.05)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > California > Alameda County > Berkeley (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

ProximalPointImitationLearning

Neural Information Processing SystemsFeb-10-2026, 23:25:57 GMT

Toour knowledge, such guarantees in this setting are providedforthefirsttime.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

ProximalPointImitationLearning

Neural Information Processing SystemsFeb-10-2026, 23:25:53 GMT

Toour knowledge, such guarantees in this setting are providedforthefirsttime.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

TheSensoryNeuronasaTransformer: Permutation-InvariantNeuralNetworksfor ReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 23:13:46 GMT

In complex systems, we often observe complex global behavior emerge from a collection of agents interacting with each other in their environment, with each individual agent acting only on locally available information, without knowing thefullpicture.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Technology: