Simple random search of static linear policies is competitive for reinforcement learning

Open in new window