Structure Matters: Dynamic Policy Gradient

Jun-15-2026, 04:24:40 GMT–Neural Information Processing Systems

In this work, we study γ-discounted infinite-horizon tabular Markov decision processes (MDPs) and introduce a framework called dynamic policy gradient (DynPG).

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Jun-15-2026, 04:24:40 GMT

Conferences PDF

Country:
- Europe (0.45)
- North America > United States
  - Illinois (0.28)

Genre:
- Research Report > Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.69)
  - Machine Learning
    - Statistical Learning (0.95)
    - Reinforcement Learning (0.93)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found