Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

Dec-27-2025, 15:54:12 GMT–Neural Information Processing Systems

We study Reinforcement Learning for partially observable dynamical systems using function approximation. We propose a new Partially Observable Bilinear Actor-Critic framework, that is general enough to include models such as observable tabular Partially Observable Markov Decision Processes (POMDPs), observable Linear-Quadratic-Gaussian (LQG), Predictive State Representations (PSRs), as well as a newly introduced model Hilbert Space Embeddings of POMDPs and observable POMDPs with latent low-rank transition.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Dec-27-2025, 15:54:12 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - Japan > Honshū
    - Kantō > Kanagawa Prefecture (0.04)
  - Middle East > Jordan (0.04)
- Europe
  - Germany > Hesse
    - Darmstadt Region > Darmstadt (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Undirected Networks
    - Markov Models (1.00)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
03d7e13f0092405804f3a381ade8f3f0-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found