Natural Value Approximators: Learning when to Trust Past Estimates
Zhongwen Xu, Joseph Modayil, Hado P. van Hasselt, Andre Barreto, David Silver, Tom Schaul
–Neural Information Processing Systems
Furthermore, as the interpolation is learned and state-dependent, our method can deal with heterogeneous observability.
Neural Information Processing Systems
Nov-21-2025, 14:02:02 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.14)
- North America > United States
- California > Los Angeles County
- Long Beach (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- California > Los Angeles County
- Europe > United Kingdom
- Industry:
- Education (0.46)
- Leisure & Entertainment > Games (0.47)
- Technology: