AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

fc95fa5740ba01a870cfa52f671fe1e4-Supplemental.pdf

Neural Information Processing SystemsFeb-12-2026, 01:02:46 GMT

high probability, probability, sequence, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.46)

Add feedback

fc95fa5740ba01a870cfa52f671fe1e4-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 01:02:42 GMT

high probability, probability, sequence, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.46)

Add feedback

Simplifying Latent Dynamics with Softly State-Invariant World Models

Neural Information Processing SystemsFeb-12-2026, 00:59:09 GMT

This makes the world model softly state-invariant.

information, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
(2 more...)

Add feedback

56a225639da77e8f7c0409f6d5ba996b-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:56:23 GMT

machine learning, natural language, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States (0.04)
Europe > Finland (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

42c8938e4cf5777700700e642dc2a8cd-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 00:47:56 GMT

formulation, inverse reinforcement, reward function, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

42c8938e4cf5777700700e642dc2a8cd-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 00:47:41 GMT

It also makes no assumptions on the sparseness of the transitions. Our experiments reflect this as well,25 as the transition probabilities are drawn from a uniform distribution with no sparseness assumptions and would be26 more difficult tothan sparse cases.

artificial intelligence, inverse reinforcement learning, machine learning, (2 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.06)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)

Add feedback

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Neural Information Processing SystemsFeb-12-2026, 00:38:39 GMT

Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments.

large language model, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > China > Chongqing Province > Chongqing (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.93)

Industry: Leisure & Entertainment > Games > Computer Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

cf5a019ae9c11b4be88213ce3f85d85c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:23:17 GMT

Here, we focus on a more practical setting in object rearrangement,i.e., rearranging objects from shuffled layouts to a normative target distribution without explicit goal specification. However, it remains challenging for AI agents, as it is hard to describe the target distribution (goal specification) for reward engineering or collect expert trajectories as demonstrations. Hence, it is infeasible to directly employ reinforcement learning or imitation learning algorithms to address the task. This paper aims to search for a policy only with a set of examples from a target distribution instead of a handcrafted reward function. We employ the score-matching objectiveto train aTargetGradientField (TarGF),indicating a direction on each object to increase the likelihood of the target distribution.

machine learning, reinforcement learning, sac, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report (0.46)

Technology: