5d2c2cee8ab0b9a36bd1ed7196bd6c4a-Paper.pdf
–Neural Information Processing Systems
We study theregretincurred bytheagent, firstwhen sheknowsherrewardfunction but does not know the distribution of the task duration, and then when she does not knowher reward function, either.
Neural Information Processing Systems
Feb-8-2026, 21:45:08 GMT
- Country:
- Europe > France
- Île-de-France > Paris > Paris (0.04)
- North America > United States
- California (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Europe > France
- Technology: