Policy Improvement via Imitation of Multiple Oracles
–Neural Information Processing Systems
Despite its promise, reinforcement learning's real-world adoption has been hampered by the need for costly exploration to learn a good policy.
Neural Information Processing Systems
Oct-2-2025, 17:37:15 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America
- Canada (0.04)
- United States > Massachusetts
- Middlesex County > Belmont (0.04)
- Asia > Middle East
- Technology: