Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
–Neural Information Processing Systems
Learning the minimum/maximum mean among a finite set of distributions is a fundamental sub-task in planning, game tree search and reinforcement learning.
Neural Information Processing Systems
Nov-20-2025, 17:38:57 GMT
- Country:
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- North America > Canada
- Europe > Netherlands
- Industry:
- Leisure & Entertainment > Games (0.68)
- Technology: