[P] Landing the Falcon booster with Reinforcement Learning in OpenAI • r/MachineLearning

Feb-17-2018, 13:40:40 GMT–#artificialintelligence

There has been a discussion recently about using RL to land a SpaceX booster. Coincidentally I've been working on exactly this in OpenAI. It was as much fun as it was frustrating at times. It's trained with a PPO implementation from Unity that I've changed to work with OpenAI (GitHub). The official OpenAI implementation is convoluted and impossible to work with in my opinion. This particular agent took 200'000 tries over the course of 12 hours and 20 million frames (with a frame skip value of 5, so 100 million total frames).

large language model, machine learning, reinforcement learning, (9 more...)

#artificialintelligence

Feb-17-2018, 13:40:40 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found