ProvableModel-based NonlinearBanditand ReinforcementLearning: ShelveOptimism,Embrace VirtualCurvature
–Neural Information Processing Systems
A key algorithmic insight is that optimism may lead to over-exploration even for two-layer neural net model class.
Neural Information Processing Systems
Feb-11-2026, 11:36:04 GMT
- Country:
- North America > United States
- Washington > King County
- Seattle (0.04)
- Massachusetts > Suffolk County
- Chelsea (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Washington > King County
- Asia > Middle East
- Jordan (0.05)
- North America > United States
- Technology: