Efficient Potential-based Exploration in Reinforcement Learning using Inverse Dynamic Bisimulation Metric
–Neural Information Processing Systems
While a number of RL methods have been proposed to boost exploration by designing an intrinsic reward signal as exploration bonus.
Neural Information Processing Systems
Feb-15-2026, 05:07:01 GMT
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- China
- Hong Kong (0.04)
- Zhejiang Province > Hangzhou (0.04)
- Macao (0.14)
- China
- Africa > Ethiopia
- Industry:
- Leisure & Entertainment > Games > Computer Games (0.47)
- Technology: