KALM: Knowledgeable Agent by Offline Reinforcement Learning from Large Language Model Rollouts Jing-Cheng Pang, Si-Hang Y ang, Kaiyuan Li, Xiong-Hui Chen, Nan T ang, Y ang Y u

Oct-10-2025, 19:42:35 GMT–Neural Information Processing Systems

Reinforcement learning (RL) traditionally trains agents using interaction data, which limits their capabilities to the scope of the training data.

llm, red ball, rollout, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 19:42:35 GMT

Conferences PDF

Country:
- Asia > China > Jiangsu Province > Nanjing (0.04)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Leisure & Entertainment (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.93)

Duplicate Docs Excel Report

Title
e4cdb4090e04816422afcbb08d4badcf-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found