Prioritizing Samples in Reinforcement Learning with Reducible Loss
–Neural Information Processing Systems
Most reinforcement learning algorithms take advantage of an experience replay buffer to repeatedly train on samples the agent has observed in the past. Not all samples carry the same amount of significance and simply assigning equal importance to each of the samples is a naive strategy. In this paper, we propose a method to prioritize samples based on how much we can learn from a sample.
Neural Information Processing Systems
Feb-11-2026, 08:36:46 GMT
- Country:
- South America > Brazil
- Pernambuco (0.04)
- North America
- United States
- Illinois > Cook County
- Chicago (0.04)
- Arizona > Maricopa County
- Phoenix (0.04)
- Illinois > Cook County
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Vancouver (0.04)
- United States
- Europe
- South America > Brazil
- Genre:
- Research Report > New Finding (0.46)
- Technology: