tensorflow/agents

Sep-8-2017, 23:46:27 GMT–@machinelearnbot

This project provides optimized infrastructure for reinforcement learning. It extends the OpenAI gym interface to multiple parallel environments and allows agents to be implemented in TensorFlow and perform batched computation. As a starting point, we provide BatchPPO, an optimized implementation of Proximal Policy Optimization. The algorithm to use is defined in the configuration and pendulum started here uses the included PPO implementation. Check out more pre-defined configurations in agents/scripts/configs.py.

large language model, machine learning, reinforcement learning, (8 more...)

@machinelearnbot

Sep-8-2017, 23:46:27 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.35)
    - Chatbot (0.35)
  - Machine Learning
    - Reinforcement Learning (0.31)
    - Neural Networks > Deep Learning
      - Generative AI (0.35)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found