Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
–Neural Information Processing Systems
We provide both theoretical analysis and experimental results to validate the effectiveness of our proposed algorithm.
Neural Information Processing Systems
Feb-16-2026, 02:51:20 GMT
- Technology: