58a799d16fb0c1f2014e98f4ba972b25-Paper-Conference.pdf
–Neural Information Processing Systems
RL that utilize function approximation to generalize observational data to unknown states/actions. The goal of this paper is to study the sample complexity of policy-based RL, which is arguably the simplest setting for RL with function approximation (Kearns et al., 1999; Kakade, 2003).
Neural Information Processing Systems
Feb-12-2026, 03:11:03 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London (0.04)
- England
- North America > United States
- Washington > King County > Seattle (0.04)
- Asia > Middle East
- Technology: