Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes Supplementary Materials

Neural Information Processing Systems 

All these tasks are benchmarks that are used for testing models with long-term memory. It is similar to the previous one. Each image is flattened to one dimensional vector. The agent can only move forward along the corridor. The agent receives the reward only at the end of the episode.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found