Simple Reinforcement Learning with Tensorflow: Part 2 - Policy-based Agents

Apr-8-2018, 01:12:56 GMT–#artificialintelligence

After a weeklong break, I am back again with part 2 of my Reinforcement Learning tutorial series. In Part 1, I had shown how to put together a basic agent that learns to choose the more rewarding of two possible options. In this post, I am going to describe how we get from that simple agent to one that is capable of taking in an observation of the world, and taking actions which provide the optimal reward not just in the present, but over the long run. With these additions, we will have a full reinforcement agent. Environments which pose the full problem to an agent are referred to as Markov Decision Processes (MDPs).

agent, policy-based agent, simple reinforcement learning, (7 more...)

#artificialintelligence

Apr-8-2018, 01:12:56 GMT

News Web Page

Add feedback

Genre:
- Instructional Material > Course Syllabus & Notes (0.57)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.75)
  - Representation & Reasoning > Agents (0.56)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found