Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL Y ang Yue

Neural Information Processing Systems 

We also give unique insights into its effectiveness.