Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations
–Neural Information Processing Systems
In real-world reinforcement learning (RL) systems, various forms of {\it impaired observability} can complicate matters. These situations arise when an agent is unable to observe the most recent state of the system due to latency or lossy channels, yet the agent must still make real-time decisions. This paper introduces a theoretical investigation into efficient RL in control systems where agents must act with delayed and missing state observations.
Neural Information Processing Systems
Dec-26-2025, 08:29:19 GMT
- Technology: