Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data

Oct-9-2025, 06:05:35 GMT–Neural Information Processing Systems

With no switches, i.e., when a fully non-reactive data collection strategy is

algorithm, probability, sparsified mdp, (11 more...)

Neural Information Processing Systems

Oct-9-2025, 06:05:35 GMT

Conferences PDF

Country:
- North America > United States (0.14)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.67)

Industry:
- Health & Medicine (0.31)

Technology:
- Information Technology
  - Data Science > Data Mining (0.93)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
bcdaaa1aec3ae2aa39542acefdec4e4b-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found