Diversify \& Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

Dec-26-2025, 12:23:06 GMT–Neural Information Processing Systems

Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards.

diversify & conquer, out-of-distribution disagreement, outcome-directed curriculum rl, (6 more...)

Neural Information Processing Systems

Dec-26-2025, 12:23:06 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.57)