KALM: Knowledgeable Agent by Offline Reinforcement Learning from Large Language Model Rollouts Jing-Cheng Pang, Si-Hang Y ang, Kaiyuan Li, Xiong-Hui Chen, Nan T ang, Y ang Y u

Neural Information Processing Systems 

Reinforcement learning (RL) traditionally trains agents using interaction data, which limits their capabilities to the scope of the training data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found