Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation

Aug-14-2025, 11:44:55 GMT–Neural Information Processing Systems

In reinforcement learning (RL), an agent interacts with a stochastic environment in order to maximize the total reward [Sutton and Barto, 2018].

algorithm, approximation, function approximation, (12 more...)

Neural Information Processing Systems

Aug-14-2025, 11:44:55 GMT

Conferences PDF

Country:
- North America
  - Canada > Alberta (0.14)
  - United States
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - New York > Erie County
      - Buffalo (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
      - Belmont (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.45)

Duplicate Docs Excel Report

Title
Non-AsymptoticAnalysisforTwoTime-scaleTDC withGeneralSmoothFunctionApproximation

Similar Docs Excel Report more

Title	Similarity	Source
None found