Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models

Neural Information Processing Systems 

Learning from self-sampled data and sparse environmental feedback remains a fundamental challenge in training self-evolving agents.