Learning Memory Mechanisms for Decision Making through Demonstrations
Yue, William, Liu, Bo, Stone, Peter
–arXiv.org Artificial Intelligence
In Partially Observable Markov Decision Processes, integrating an agent's history into memory poses a significant challenge for decision-making. Traditional imitation learning, relying on observation-action pairs for expert demonstrations, fails to capture the expert's memory mechanisms used in decision-making. To capture memory processes as demonstrations, we introduce the concept of memory dependency pairs $(p, q)$ indicating that events at time $p$ are recalled for decision-making at time $q$. We introduce AttentionTuner to leverage memory dependency pairs in Transformers and find significant improvements across several tasks compared to standard Transformers when evaluated on Memory Gym and the Long-term Memory Benchmark. Code is available at https://github.com/WilliamYue37/AttentionTuner.
arXiv.org Artificial Intelligence
Nov-12-2024
- Country:
- North America > United States (0.68)
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (0.98)
- Technology: