MetaCURL: Non-stationary Concave Utility Reinforcement Learning Bianca Marin Moreno Inria
–Neural Information Processing Systems
We explore online learning in episodic Markov decision processes on non-stationary environments (changing losses and probability transitions).
Neural Information Processing Systems
Oct-10-2025, 19:02:14 GMT
- Country:
- Europe
- France
- Auvergne-Rhône-Alpes > Isère
- Grenoble (0.04)
- Île-de-France > Paris
- Paris (0.04)
- Auvergne-Rhône-Alpes > Isère
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- France
- Europe
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education > Educational Setting
- Online (0.48)
- Energy > Power Industry (0.45)
- Education > Educational Setting