Temporal-Differential Learning in Continuous Environments
–arXiv.org Artificial Intelligence
In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.
arXiv.org Artificial Intelligence
Jun-1-2020
- Country:
- North America > United States
- Ohio (0.04)
- District of Columbia > Washington (0.04)
- New York
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- New Jersey > Hudson County
- Hoboken (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Europe
- Italy (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- Greece > Central Macedonia
- Thessaloniki (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Government (0.92)
- Technology: