AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

AgentModellingunderPartialObservabilityfor DeepReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 08:00:56 GMT

Existing methods for agent modelling commonly assume knowledge of the local observations and chosen actions of the modelled agents during execution. To eliminate this assumption, we extract representations from thelocalinformation ofthecontrolled agent using encoderdecoderarchitectures.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

c7af0926b294e47e52e46cfebe173f20-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 07:33:03 GMT

agent, hyperparameter, optimization, (13 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(2 more...)

Add feedback

80b7bec60081f95d900973509744a306-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 07:17:27 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)

Add feedback

I2Q: AFullyDecentralizedQ-LearningAlgorithm

Neural Information Processing SystemsFeb-10-2026, 07:04:57 GMT

The modeling of ideal transition function inI2Q isfully decentralized and independent from the learned policies of other agents, helping I2Q be free from non-stationarity and learn the optimal policy.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

9f0609b9d45dd55bed75f892cf095fcf-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 07:04:33 GMT

arxiv preprint arxiv, learning, neural network, (13 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

9eed867b73ab1eab60583c9d4a789b1b-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 07:03:28 GMT

algorithm, denote, inequality, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.45)

Industry: Leisure & Entertainment > Games (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.93)

Add feedback

802a4350ca4fced76b13b8b320af1543-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 06:44:57 GMT

augmentation, data augmentation, srm, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

SpectrumRandomMaskingforGeneralizationin Image-based ReinforcementLearning

Neural Information Processing SystemsFeb-10-2026, 06:44:53 GMT

To handle this problem, a natural approach is to increase the data diversity by image based augmentations. However, different with most vision tasks such as classification and detection, RL tasks are not always invariant to spatial based augmentations duetotheentanglement ofenvironment dynamics andvisual appearance.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

High-ThroughputSynchronousDeepRL

Neural Information Processing SystemsFeb-10-2026, 06:44:26 GMT

Deep reinforcement learning (RL) is computationally demanding and requiresprocessing of many data points. Synchronous methods enjoy training stability while having lowerdatathroughput.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: