[P] Landing the Falcon booster with Reinforcement Learning in OpenAI • r/MachineLearning
There has been a discussion recently about using RL to land a SpaceX booster. Coincidentally I've been working on exactly this in OpenAI. It was as much fun as it was frustrating at times. It's trained with a PPO implementation from Unity that I've changed to work with OpenAI (GitHub). The official OpenAI implementation is convoluted and impossible to work with in my opinion. This particular agent took 200'000 tries over the course of 12 hours and 20 million frames (with a frame skip value of 5, so 100 million total frames).
Feb-17-2018, 13:40:40 GMT
- Technology: