Temporal-Differential Learning in Continuous Environments

Jun-1-2020–arXiv.org Artificial Intelligence

In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Jun-1-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Ohio (0.04)
  - District of Columbia > Washington (0.04)
  - New York
    - New York County > New York City (0.04)
    - Kings County > New York City (0.04)
  - New Jersey > Hudson County
    - Hoboken (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe
  - Italy (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)
  - Greece > Central Macedonia
    - Thessaloniki (0.04)

Genre:
- Research Report (1.00)

Industry:
- Government (0.92)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found