AITopics | Reinforcement Learning

OfflineReinforcementLearningwithDifferential Privacy

Neural Information Processing SystemsFeb-16-2026, 22:28:00 GMT

Since offline RL does not require access to the environment, it can be applied to problems where interaction with environment is infeasible,e.g., when collecting new data is costly (trade or finance [Zhang et al., 2020]), risky (autonomous driving [Sallab et al., 2017]) or illegal / unethical (healthcare [Raghu etal.,2017]).

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Industry:

Health & Medicine (0.48)
Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

c056d6cf7b7108418f2b8c307dfaab02-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 22:07:39 GMT

Further, we are interested in algorithms that enjoy sample efficiency while leveraging (value) function approximation.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland > Baltimore (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

bf89c9fcd0ef605571a03666f6a6a44d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 21:44:27 GMT

Furthermore, the theoretical guarantees of the transitivity and aggregation error bound are justified.

machine learning, reinforcement learning, state abstraction, (19 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.68)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

99d7a578d72ed133203d1f88c9d39044-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 21:43:40 GMT

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon > Lane County > Eugene (0.14)
Asia > Singapore (0.04)
North America > United States > Ohio > Lucas County > Oregon (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

bf665e1cf271faa5037374c884ba3808-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 21:43:08 GMT

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > Canada > Ontario (0.04)
Asia > China > Hong Kong (0.04)
(2 more...)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

Dynamic Regret of Adversarial Linear Mixture MDPs

Neural Information Processing SystemsFeb-16-2026, 21:18:10 GMT

We study reinforcement learning in episodic inhomogeneous MDPs with adversarial full-information rewards and the unknown transition kernel. We consider the linear mixture MDPs whose transition kernel is a linear mixture model and choose the dynamic regret as the performance measure.

dynamic regret, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: