Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL Qi Lv

Neural Information Processing Systems 

Thus, this kind of fine-grained intrinsic connection among RSAs is intuitively beneficial for policy learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found