LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search
–Neural Information Processing Systems
The integration of evolutionary algorithms (EAs) with reinforcement learning (RL) has shown superior performance compared to standalone methods. However, previous research focuses on exploration in policy parameter space, while overlooking the reward function search. To bridge this gap, we propose LaRes, a novel hybrid framework that achieves efficient policy learning through reward function search. LaRes leverages large language models (LLMs) to generate the reward function population, guiding RL in policy learning. The reward functions are evaluated by the policy performance and improved through LLMs.
Neural Information Processing Systems
Jun-15-2026, 14:47:03 GMT
- Country:
- Asia (0.28)
- Genre:
- Research Report > Experimental Study (1.00)
- Overview (1.00)
- Industry:
- Education (0.46)
- Technology: