An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context
–Neural Information Processing Systems
One of the key challenges in deploying RL to real-world applications is to adapt to variations of unknown environment contexts, such as changing terrains in robotic tasks and fluctuated bandwidth in congestion control. Existing works on adaptation to unknown environment contexts either assume the contexts are the same for the whole episode or assume the context variables are Markovian. However, in many real-world applications, the environment context usually stays stable for a stochastic period and then changes in an abrupt and unpredictable manner within an episode, resulting in a segment structure, which existing works fail to address.
Neural Information Processing Systems
Dec-25-2025, 14:06:21 GMT
- Technology: