AITopics | Reinforcement Learning

edb446b67d69adbfe9a21068982000c2-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 18:57:33 GMT

agent, bellman operator, k-learning, (13 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

2c3ddf4bf13852db711dd1901fb517fa-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 18:55:27 GMT

As[R1]38 has pointed out, our novel interpretation of KL term gives new insights and variations on online Bayesian learning.39 Since UCL samples the weight parameters only once for each iteration, applying it to actor-critic based42 reinforcement learning algorithm becomes possible.

artificial intelligence, machine learning, reinforcement learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.78)

Add feedback

Shaping_Belief_States_with_Generative_Environment_Models_for_RL

Neural Information Processing SystemsFeb-11-2026, 18:48:21 GMT

agent, arxiv preprint arxiv, generative model, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.70)

Add feedback

Interval timing in deep reinforcement learning agents

Ben Deverett, Ryan Faulkner, Meire Fortunato, Gregory Wayne, Joel Z. Leibo

Neural Information Processing SystemsFeb-11-2026, 18:47:09 GMT

The measurement of time is central to intelligent behavior.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Information Design in Multi-Agent Reinforcement Learning

Neural Information Processing SystemsFeb-11-2026, 18:38:15 GMT

To thrive in those environments, the agent needs to influence other agents so their actions become more helpful and less harmful. Research in computational economics distills two ways to influence others directly: by providing tangible goods ( mechanism design) and by providing information ( information design). This work investigates information design problems for a group of RL agents. The main challenges are two-fold. One is the information provided will immediately affect the transition of the agent trajectories, which introduces additional non-stationarity. The other is the information can be ignored, so the sender must provide information that the receiver is willing to respect.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Kosovo > District of Gjilan > Kamenica (0.04)
Asia > China > Hong Kong (0.04)
(4 more...)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)

Add feedback

Information Design in Multi-Agent Reinforcement Learning

Neural Information Processing SystemsFeb-11-2026, 18:38:11 GMT

To thrive in those environments, the agent needs to influence other agents so their actions become more helpful and less harmful. Research in computational economics distills two ways to influence others directly: by providing tangible goods ( mechanism design) and by providing information ( information design). This work investigates information design problems for a group of RL agents. The main challenges are two-fold. One is the information provided will immediately affect the transition of the agent trajectories, which introduces additional non-stationarity. The other is the information can be ignored, so the sender must provide information that the receiver is willing to respect.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Kosovo > District of Gjilan > Kamenica (0.04)
Asia > China > Hong Kong (0.04)
(4 more...)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

2b8f621e9244cea5007bac8f5d50e476-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 18:37:55 GMT

arxiv preprint arxiv, demonstration, trajectory, (11 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)

Add feedback

51053d7b8473df7d5a2165b2a8ee9629-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 18:27:09 GMT

machine learning, natural language, reinforcement learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

RobustImitationvia MirrorDescentInverseReinforcementLearning

Neural Information Processing SystemsFeb-11-2026, 18:16:17 GMT

Inspired by a first-order optimization method called mirror descent, this paper proposes topredict asequence ofrewardfunctions, which areiterativesolutions for a constrained convex problem. IRL solutions derived by mirror descent are tolerant totheuncertainty incurred bytargetdensity estimation sincetheamount of reward learning is regulated with respect to local geometric constraints.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Optimizing Data Collection for Machine Learning

Neural Information Processing SystemsFeb-11-2026, 17:47:43 GMT

For eachDk subsets, respectively, we follow the same subsampling procedure used in the singlevariate case. That is, we letq10 = 10% of the first data subset andq20 = 10% of the second data subset.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

Filters

Collaborating Authors

Reinforcement Learning

edb446b67d69adbfe9a21068982000c2-Paper.pdf

2c3ddf4bf13852db711dd1901fb517fa-AuthorFeedback.pdf

Shaping_Belief_States_with_Generative_Environment_Models_for_RL

Interval timing in deep reinforcement learning agents

Information Design in Multi-Agent Reinforcement Learning

Information Design in Multi-Agent Reinforcement Learning

2b8f621e9244cea5007bac8f5d50e476-Paper.pdf

51053d7b8473df7d5a2165b2a8ee9629-Paper-Conference.pdf

RobustImitationvia MirrorDescentInverseReinforcementLearning

Optimizing Data Collection for Machine Learning