Regret Minimization in MDPs with Options without Prior Knowledge

Oct-3-2024, 22:20:59 GMT–Neural Information Processing Systems

Recent works leveraged the mapping of Markov decision processes (MDPs) with options to semi-MDPs (SMDPs) and introduced SMDP-versions of exploration-exploitation algorithms (e.g.,

algorithm, confidence interval, temporal abstraction, (12 more...)

Neural Information Processing Systems

Oct-3-2024, 22:20:59 GMT

Conferences PDF

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Florida > Broward County
    - Fort Lauderdale (0.04)
  - California
    - San Francisco County > San Francisco (0.14)
    - Santa Clara County > Palo Alto (0.04)
    - Los Angeles County > Long Beach (0.04)
- Europe > Finland
  - Uusimaa > Helsinki (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.68)

Duplicate Docs Excel Report

Title
Regret Minimization in MDPs with Options without Prior Knowledge
Regret Minimization in MDPs with Options without Prior Knowledge

Similar Docs Excel Report more

Title	Similarity	Source
None found