Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback

Oct-10-2025, 21:24:49 GMT–Neural Information Processing Systems

Previous work has investigated this problem under either full-information loss feedback with unknown transitions (Zhao et al., 2024), or bandit

algorithm, low-rank mdp, probability, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 21:24:49 GMT

Conferences PDF

Country:
- North America > United States
  - Virginia (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > Experimental Study (0.92)

Technology:
- Information Technology
  - Data Science (0.67)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Learning Graphical Models
      - Undirected Networks > Markov Models (0.46)

Duplicate Docs Excel Report

Title
f2ae937c93d45def2b0eb246831c8685-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found