Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation Long-Fei Li
–Neural Information Processing Systems
Reinforcement Learning (RL) with function approximation has achieved remarkable success in various applications involving large state and action spaces, such as games [Silver et al., 2016],
Neural Information Processing Systems
Oct-10-2025, 05:14:09 GMT
- Country:
- Asia
- China > Jiangsu Province
- Nanjing (0.04)
- Japan > Honshū
- Kantō
- Chiba Prefecture > Chiba (0.04)
- Tokyo Metropolis Prefecture > Tokyo (0.04)
- Kantō
- Middle East > Jordan (0.04)
- China > Jiangsu Province
- Asia
- Genre:
- Research Report > Experimental Study (1.00)
- Technology: