AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

440924c5948e05070663f88e69e8242b-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 06:04:43 GMT

algorithm, arxiv preprint arxiv, complexity, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.43)

Add feedback

SimandReal: BetterTogether

Neural Information Processing SystemsFeb-8-2026, 05:44:52 GMT

We achieve that by maintaining a replay buffer for each environment the agent interacts with.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel > Haifa District > Haifa (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

ExplainabilityViaCausalSelf-Talk

Neural Information Processing SystemsFeb-8-2026, 05:36:40 GMT

Asmodern machinelearning systems become morepowerfulandembedded inourlives,theneed to have these systems explain their behavior becomes increasingly urgent.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > Alameda County > Livermore (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > France (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.30)

Add feedback

43207fd5e34f87c48d584fc5c11befb8-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 05:35:52 GMT

algorithm, mdp, sample complexity, (11 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

322e4a595afd9442a89f0bfaa441871e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 05:34:33 GMT

dataset, demonstration, task-specific dataset, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

ANon-asymptotic Analysisof Non-parametric Temporal-Difference Learning

Neural Information Processing SystemsFeb-8-2026, 05:25:51 GMT

Theorem 1.Let n 9. Underassumption(A2) with 1 < 1, thereexistapositivereal number independentofnsuchthat, for 0 , (a) Using = 0n Also, simplecomputationsshowthatV is anaffinetransformofr: V (x)= ar(x)+ b, witha =( 1 (1 ")) 1 andb = a Wealsoacknowledgesupport fromthe European Research Council (gran...

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Regret_Bounds_of_Concurrent_Thompson_Sampling

陈琰

Neural Information Processing SystemsFeb-8-2026, 05:25:34 GMT

agent, algorithm, learning, (13 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.65)

Add feedback

Appendix A Visual Reinforcement Learning Baselines DrQ: This model-free, off-policy reinforcement learning algorithm, is based on Soft Actor-Critic (SAC) [

Neural Information Processing SystemsFeb-8-2026, 05:24:54 GMT

Meanwhile, we utilize the 3D scenes from the Gibson dataset as our map for all experiments. Autonomous driving: We choose the stable version of CARLA 0.9.10 for simulation.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: Europe > Sweden > Skåne County > Malmö (0.04)

Industry: Transportation > Ground > Road (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization

Neural Information Processing SystemsFeb-8-2026, 05:24:51 GMT

Visual Reinforcement Learning (Visual RL), coupled with high-dimensional observations, has consistently confronted the long-standing challenge of out-of-distribution generalization.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Skåne County > Malmö (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report (0.46)
Overview (0.46)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Grounded ReinforcementLearning: LearningtoWintheGameunderHumanCommands SupplementaryMaterials

Neural Information Processing SystemsFeb-8-2026, 05:06:43 GMT

Inthis section, we describe the details ofMiniRTSEnvironment and human dataset. The data do not contain any personally identifiable information or offensivecontent. Figure 1: MiniRTS [2]implements the rockpaper-scissors attack graph, each army type has some units it is effective against and vulnerableto. "swordman","spearman"and"cavalry"allare effectiveagainst"archer" Figure 2: Building units can produce different army units using resources. Resource Units: Resource units are stationary and neutral.

catapult, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Industry: Government > Military > Army (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)

Add feedback