Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis

Aug-15-2025, 16:32:31 GMT–Neural Information Processing Systems

Recently, several work proposed to apply the variance reduction technique developed in the stochastic optimization literature to reduce the variance of TD learning.

algorithm, convergence error, vrtdc, (11 more...)

Neural Information Processing Systems

Aug-15-2025, 16:32:31 GMT

Conferences PDF

Country:
- North America
  - Canada > Alberta (0.14)
  - United States
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - New York > Erie County
      - Buffalo (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Statistical Learning (0.88)

Duplicate Docs Excel Report

Title
a992995ef4f0439b258f2360dbb85511-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found