AITopics | Reinforcement Learning

Intuitively, given a behavior generated under one task, Generalized Hindsight returns a different task that the behavior is better suited for. Then, the behavior is relabeled with this new task before being used by an off-policy RL optimizer.

machine learning, reinforcement learning, trajectory, (11 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry:

Energy (0.93)
Transportation (0.68)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Towards Robust Bisimulation Metric Learning

Neural Information Processing SystemsOct-2-2025, 23:33:54 GMT

Learned representations in deep reinforcement learning (DRL) have to extract task-relevant information from complex observations, balancing between robustness to distraction and informativeness to the policy.

bisimulation metric, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
North America > Canada > Ontario > Toronto (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning

Gregory Farquhar, Shimon Whiteson, Jakob Foerster

Neural Information Processing SystemsOct-2-2025, 23:28:19 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

569ff987c643b4bedf504efda8f786c2-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 23:27:21 GMT

machine learning, natural language, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre:

Overview (0.68)
Research Report (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Education (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

569ff987c643b4bedf504efda8f786c2-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 23:27:10 GMT

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning from Trajectories via Subgoal Discovery

Sujoy Paul, Jeroen Vanbaar, Amit Roy-Chowdhury

Neural Information Processing SystemsOct-2-2025, 23:25:48 GMT

Neural Information Processing Systems http://nips.cc/

machine learning, reinforcement learning, trajectory, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry:

Automobiles & Trucks (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Automatic Curriculum Learning through Value Disagreement

Neural Information Processing SystemsOct-2-2025, 23:23:00 GMT

Continually solving new, unsolved tasks is the key to learning diverse behaviors. Through reinforcement learning (RL), we have made massive strides towards solving tasks that have a single goal.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.46)

Industry: Education (0.93)

Technology: