Reviews: Simple random search of static linear policies is competitive for reinforcement learning

Oct-7-2024, 14:11:46 GMT–Neural Information Processing Systems

The main idea is to demonstrate the effectiveness of these simple algorithms compared to the much more complex state-of-the-art RL algorithms proposed and evaluated on MuJoCo tasks. The results of the empirical evaluation are startling. The paper convincingly demonstrates very strong performance of the simple algorithm and policy class on the MuJoCo tasks. The evaluation is extremely thorough, the results are compelling and raise serious questions about the current state of RL algorithm evaluation methodology using MuJoCo. In my opinion, this paper is an excellent contribution to the RL literature.

algorithm, evaluation, linear policy, (11 more...)

Neural Information Processing Systems

Oct-7-2024, 14:11:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)