Faster Deep Reinforcement Learning with Slower Online Network

Nov-15-2025, 06:23:18 GMT–Neural Information Processing Systems

Deep reinforcement learning algorithms often use two networks for value function optimization: an online network, and a target network that tracks the online network with some delay. Using two separate networks enables the agent to hedge against issues that arise when performing bootstrapping.

algorithm, learning, proximal term, (15 more...)

Neural Information Processing Systems

Nov-15-2025, 06:23:18 GMT

Conferences PDF

Add feedback

Country:
- Europe
  - Russia (0.04)
  - France (0.04)
- Asia
  - Russia (0.04)
  - Middle East > Jordan (0.04)
  - South Korea > Seoul
    - Seoul (0.04)

Industry:
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
7dfa77fcef807c9a078b58fd619ad897-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found