Self-Generated In-Context Examples Improve LLMAgents for Sequential Decision-Making Tasks
–Neural Information Processing Systems
Improving Large Language Model (LLM) agents for sequential decision-making tasks typically requires extensive task-specific knowledge engineering--custom prompts, curated examples, and specialized observation/action spaces. We investigate a different approach where agents automatically improve by learning from their own successful experiences without human intervention. Our method constructs and refines a database of self-generated trajectories that serve as in-context examples for future tasks.
Neural Information Processing Systems
Jun-17-2026, 14:42:10 GMT