AITopics | Reinforcement Learning

Object-oriented representations in reinforcement learning have shown promise in transfer learning, with previous research introducing a propositional objectoriented framework that has provably efficient learning bounds with respect to samplecomplexity.

machine learning, reinforcement learning, transition dynamic, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Africa > South Africa > Gauteng > Pretoria (0.04)
Africa > South Africa > Gauteng > Johannesburg (0.04)

Industry: Transportation > Passenger (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.36)

Add feedback

Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

Neural Information Processing SystemsFeb-14-2026, 21:54:01 GMT

artificial intelligence, interpretable reinforcement learning, machine learning, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.52)

Add feedback

Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples

Tengyu Xu, Shaofeng Zou, Yingbin Liang

Neural Information Processing SystemsFeb-14-2026, 21:05:28 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, convergence, stepsize, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > United States > Ohio (0.04)

Genre: Research Report > New Finding (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

5e5853f35164e434015716a8c2a66543-Paper-Conference.pdf

Neural Information Processing SystemsFeb-14-2026, 21:04:53 GMT

arxiv preprint arxiv, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Europe > Netherlands (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Leisure & Entertainment > Games (0.68)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(2 more...)

Add feedback

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

Su Young Lee, Choi Sungik, Sae-Young Chung

Neural Information Processing SystemsFeb-14-2026, 20:40:46 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, episodic backward update, transition, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > Canada (0.04)

Industry: Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

e6d8545daa42d5ced125a4bf747b3688-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-14-2026, 20:40:30 GMT

The common specifications in Appendix D are just detailed descriptions of each hyperparameter used in Nature7 DQN paper that we applied to all the baselines and our method for the experiment. Many of the recent reinforcement learning methods require changes in the network structures or require additional20 memory structures (Ephemeral Value Adjustments, RUDDER). The idea of the backward update is not novel and we have stated in section 3.1 that the tabular backward update26 (Algorithm 1) is a special case of Lin's method (1992). The training process of the adaptivescheme is described in Appendix34 A.AlltheKnetworksaretrained using thesame sample episode atthesame time.

artificial intelligence, hyperparameter, reinforcement learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.81)

Add feedback

Occam's razor is insufficient to infer the preferences of irrational agents

Stuart Armstrong, Sören Mindermann

Neural Information Processing SystemsFeb-14-2026, 19:57:04 GMT

Toaddressthis, we need simple'normative' assumptions, which cannot be deduced exclusively fromobservations.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

765043fe026f7d704c96cec027f13843-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-14-2026, 19:23:57 GMT

large language model, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hong Kong (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre:

Overview (0.92)
Research Report > New Finding (0.45)

Industry: Leisure & Entertainment > Games > Computer Games (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(5 more...)

Add feedback

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Risto Vuorio, Shao-Hua Sun, Hexiang Hu, Joseph J. Lim

Neural Information Processing SystemsFeb-14-2026, 19:23:42 GMT

Neural Information Processing Systems http://nips.cc/

international conference, multimodal task distribution, task distribution, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
North America > United States > Michigan (0.04)
North America > Canada (0.04)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Exploration in Structured Reinforcement Learning

Jungseul Ok, Alexandre Proutiere, Damianos Tranos

Neural Information Processing SystemsFeb-14-2026, 19:22:48 GMT

Hence, with largestate and action spaces, it is essential to identify and exploit any possible structure existing in the system dynamics and reward function so as to minimize exploration phases and in turn reduce regret to reasonable values. Modern RL algorithms actually implicitly impose some structural properties either in the model parameters (transition probabilities and reward function, see e.g.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: