Bi-directional Recurrence Improves Transformer in Partially Observable Markov Decision Processes

Open in new window