Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation
–Neural Information Processing Systems
Reinforcement learning (RL) is a sequential decision-making problem in which an agent tries to maximize its expected cumulative reward by interacting with an unknown environment over time.
Neural Information Processing Systems
Feb-16-2026, 12:57:10 GMT
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- Middle East > Jordan (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Europe > Austria (0.04)
- North America > United States (0.04)
- Africa > Ethiopia
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Leisure & Entertainment (0.45)
- Technology: