AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning

Nicolas Carion, Nicolas Usunier, Gabriel Synnaeve, Alessandro Lazaric

Neural Information Processing SystemsOct-2-2025, 14:07:55 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.51)

2e1b24a664f5e9c18f407b2f9c73e821-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 14:02:20 GMT

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.46)
North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Reviewer 1: Q1: I wonder if their analysis tricks of AC/NAC when applied to PG methods improve their guarantees

Neural Information Processing SystemsOct-2-2025, 14:02:10 GMT

If their analysis tricks do improve PG guarantees, how does it compare then? Reviewer 2: Q1: It would be interesting to complement the theoretical results with empirical results in toy problem. We are working on experiments and will add these results to the revision. Q2: For the error term that disappears with a larger mini-batch (line 211). A2: Y es, this error term should be called as variance error.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.30)

Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Neural Information Processing SystemsOct-2-2025, 13:57:57 GMT

Recent work demonstrated that using a memory buffer of previous successful trajectories can result in more effective policies. However, existing methods may overly exploit past successful experiences, which can encourage the agent to adopt sub-optimal and myopic behaviors.

machine learning, reinforcement learning, trajectory, (15 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Mapping State Space using Landmarks for Universal Goal Reaching

Zhiao Huang, Fangchen Liu, Hao Su

Neural Information Processing SystemsOct-2-2025, 13:56:49 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America (0.14)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

2cd4e8a2ce081c3d7c32c3cde4312ef7-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 13:51:41 GMT

machine learning, natural language, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.74)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Munchausen Reinforcement Learning

Neural Information Processing SystemsOct-2-2025, 13:43:16 GMT

Bootstrapping is a core mechanism in Reinforcement Learning (RL).

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Europe (0.46)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

2c6a0bae0f071cbbf0bb3d5b11d90a82-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 13:43:09 GMT

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

Europe (0.46)
North America (0.28)

Industry: Leisure & Entertainment (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

2be5f9c2e3620eb73c2972d7552b6cb5-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 13:38:17 GMT

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country: Europe > Netherlands (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)