AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

News Overviews Instructional Materials AI-Alerts Classics

3f20f2b0315c72201e23512fdbd1ee91-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 13:00:36 GMT

constraint, gradient integration method, target distribution, (13 more...)

Neural Information Processing Systems

Country:

Asia > South Korea > Seoul > Seoul (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

3f1b6e97a5eb3b10e6b0c99b022988eb-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 13:00:29 GMT

information retrieval, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Oregon (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

3da8e709fa1a7d9e23bee89d3c25b5b4-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 12:47:21 GMT

approximation, equation, relative value function, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms

Neural Information Processing SystemsOct-8-2025, 12:47:17 GMT

Reinforcement Learning algorithms can be broadly classified into value-based methods and policy-based methods.

algorithm, iteration, policy iteration, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.14)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Mutual Information Regularized Offline Reinforcement Learning

Neural Information Processing SystemsOct-8-2025, 12:30:26 GMT

We show that optimizing this lower bound is equivalent to maximizing the likelihood of a one-step improved policy on the offline dataset. Hence, we constrain the policy improvement direction to lie in the data manifold.

estimation, misa, mutual information, (10 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Neural Information Processing SystemsOct-8-2025, 12:29:52 GMT

Artificial intelligence (AI) systems possess significant potential to drive societal progress. However, their deployment often faces obstacles due to substantial safety concerns.

agent, algorithm, hinge torque, (12 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Industry: Automobiles & Trucks (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Neural Information Processing SystemsOct-8-2025, 12:29:47 GMT

Artificial intelligence (AI) systems possess significant potential to drive societal progress. However, their deployment often faces obstacles due to substantial safety concerns.

agent, algorithm, hinge torque, (12 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Industry: Automobiles & Trucks (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Inverse Preference Learning: Preference-based RL without a Reward Function

Neural Information Processing SystemsOct-8-2025, 12:14:41 GMT

Reward functions are difficult to design and often hard to align with human intent.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

3b3889d313ba9476c12c2d77ea66b24f-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 12:12:41 GMT

dataset, history length, trajectory, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Montana (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.46)

Industry:

Health & Medicine (0.47)
Leisure & Entertainment > Games (0.32)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)