Regret Minimization in MDPs with Options without Prior Knowledge
–Neural Information Processing Systems
Recent works leveraged the mapping of Markov decision processes (MDPs) with options to semi-MDPs (SMDPs) and introduced SMDP-versions of exploration-exploitation algorithms (e.g.,
Neural Information Processing Systems
Oct-3-2024, 22:20:59 GMT
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Palo Alto (0.04)
- Los Angeles County > Long Beach (0.04)
- New York > New York County
- Europe > Finland
- North America > United States