SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning

Feb-17-2026, 04:23:54 GMT–Neural Information Processing Systems

In order to overcome overestimation bias, ensemble methods for Q-learning have been investigated to exploit the diversity of multiple Q-functions. Since network initialization has been the predominant approach to promote diversity in Q-functions, heuristically designed diversity injection methods have been studied in the literature. However, previous studies have not attempted to approach guaranteed independence over an ensemble from a theoretical perspective.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Feb-17-2026, 04:23:54 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - California > Alameda County > Berkeley (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > South Korea
  - Seoul > Seoul (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks (0.93)

Duplicate Docs Excel Report

Title
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found