Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
–Neural Information Processing Systems
Reinforcement Learning (RL) agents often learn policies that do not generalise across tasks in which the environmental features and optimal skills are different [des Combes et al., 2018, Garcin et al., 2024].
Neural Information Processing Systems
Oct-10-2025, 16:23:37 GMT
- Country:
- Asia > China
- Heilongjiang Province > Harbin (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > China
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Education (0.92)
- Technology: