Structure Matters: Dynamic Policy Gradient

Jun-10-2026, 17:24:54 GMT–Neural Information Processing Systems

In this work, we study $\gamma$-discounted infinite-horizon tabular Markov decision processes (MDPs) and introduce a framework called dynamic policy gradient (DynPG).

artificial intelligence, machine learning, proceedings, (7 more...)

Neural Information Processing Systems

Jun-10-2026, 17:24:54 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.62)