AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Learning Retrospective Knowledge with Reverse Reinforcement Learning Shangtong Zhang University of Oxford Vivek V eeriah University of Michigan, Ann Arbor Shimon Whiteson University of Oxford

Neural Information Processing SystemsAug-17-2025, 01:40:27 GMT

We present a Reverse Reinforcement Learning (Reverse RL) approach for representing retrospective knowledge .

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.86)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.76)

Industry:

Transportation > Ground > Road (0.46)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

935151cc6cb5d8b6816133b75233775a-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 01:29:49 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)

Add feedback

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Neural Information Processing SystemsAug-17-2025, 01:29:18 GMT

Human explanation (e.g., in terms of feature importance) has been recently used

explanation, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Arizona > Maricopa County > Tempe (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(2 more...)

Genre: Research Report (0.93)

Industry:

Education (0.46)
Transportation (0.30)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Joint Policy Search for Multi-agent Collaboration with Imperfect Information

Neural Information Processing SystemsAug-17-2025, 01:28:52 GMT

To learn good joint policies for multi-agent collaboration with imperfect information remains a fundamental challenge.

infoset, joint policy search, policy change, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada (0.04)

Industry: Leisure & Entertainment > Games > Bridge (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Games (1.00)

Add feedback

Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes

Neural Information Processing SystemsAug-17-2025, 01:18:58 GMT

We study minimax optimal reinforcement learning in episodic factored Markov decision processes (FMDPs), which are MDPs with conditionally independent transition components.

algorithm, factored structure, fmdp, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.85)

Add feedback

Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models Minting Pan Xiangming Zhu Y unbo Wang

Neural Information Processing SystemsAug-17-2025, 01:18:40 GMT

World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios such as autonomous driving, there commonly exists noncontrollable dynamics independent of the action signals, making it difficult to learn effective world models.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Transportation > Ground > Road (0.35)
Information Technology > Robotics & Automation (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
(2 more...)

Add feedback

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Neural Information Processing SystemsAug-17-2025, 01:18:29 GMT

These parameterized primitives are expressive, simple to implement, enable efficient exploration and can be transferred across robots, tasks and environments.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report > New Finding (0.46)

Technology: