RL: Efficient Exploration for Nonepisodic RL
–Neural Information Processing Systems
We study the problem of nonepisodic reinforcement learning (RL) for nonlinear dynamical systems, where the system dynamics are unknown and the RL agent has to learn from a single trajectory, i.e., adapt online and without resets. This setting is ubiquitous in the real world, where resetting is impossible or requires human intervention.
Neural Information Processing Systems
May-25-2025, 08:21:49 GMT
- Country:
- Europe > Switzerland > Zürich > Zürich (0.14)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Energy (0.47)
- Technology: