[P] Landing the Falcon booster with Reinforcement Learning in OpenAI • r/MachineLearning

#artificialintelligence 

There has been a discussion recently about using RL to land a SpaceX booster. Coincidentally I've been working on exactly this in OpenAI. It was as much fun as it was frustrating at times. It's trained with a PPO implementation from Unity that I've changed to work with OpenAI (GitHub). The official OpenAI implementation is convoluted and impossible to work with in my opinion. This particular agent took 200'000 tries over the course of 12 hours and 20 million frames (with a frame skip value of 5, so 100 million total frames).

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found