KALM: Knowledgeable Agent by Offline Reinforcement Learning from Large Language Model Rollouts Jing-Cheng Pang, Si-Hang Y ang, Kaiyuan Li, Xiong-Hui Chen, Nan T ang, Y ang Y u
–Neural Information Processing Systems
Reinforcement learning (RL) traditionally trains agents using interaction data, which limits their capabilities to the scope of the training data.
Neural Information Processing Systems
Oct-10-2025, 19:42:35 GMT
- Country:
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Leisure & Entertainment (0.67)
- Technology: