Model-Free Active Exploration in Reinforcement Learning
–Neural Information Processing Systems
We study the problem of exploration in Reinforcement Learning and present a novel model-free solution. We adopt an information-theoretical viewpoint and start from the instance-specific lower bound of the number of samples that have to be collected to identify a nearly-optimal policy.
Neural Information Processing Systems
Oct-9-2025, 04:24:54 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- France > Auvergne-Rhône-Alpes
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Delft (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.04)
- North America > United States
- Texas > Travis County > Austin (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.93)
- Technology: