Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions

Mar-20-2026, 02:50:22 GMT–Neural Information Processing Systems

Real-world offline datasets are often subject to data corruptions (such as noise or adversarial attacks) due to sensor failures or malicious attacks. Despite advances in robust offline reinforcement learning (RL), existing methods struggle to learn robust agents under high uncertainty caused by the diverse corrupted data (i.e., corrupted states, actions, rewards, and dynamics), leading to performance degradation in clean environments. To tackle this problem, we propose a novel robust variational Bayesian inference for offline RL (TRACER). It introduces Bayesian inference for the first time to capture the uncertainty via offline data for robustness against all types of data corruptions.

artificial intelligence, machine learning, reinforcement learning, (10 more...)

Neural Information Processing Systems

Mar-20-2026, 02:50:22 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.59)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.62)