Prioritizing Samples in Reinforcement Learning with Reducible Loss

Feb-11-2026, 08:36:46 GMT–Neural Information Processing Systems

Most reinforcement learning algorithms take advantage of an experience replay buffer to repeatedly train on samples the agent has observed in the past. Not all samples carry the same amount of significance and simply assigning equal importance to each of the samples is a naive strategy. In this paper, we propose a method to prioritize samples based on how much we can learn from a sample.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-11-2026, 08:36:46 GMT

Conferences PDF

Add feedback

Country:
- South America > Brazil
  - Pernambuco (0.04)
- North America
  - United States
    - Illinois > Cook County
      - Chicago (0.04)
    - Arizona > Maricopa County
      - Phoenix (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Vancouver (0.04)
- Europe
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Slovenia > Drava
    - Municipality of Benedikt > Benedikt (0.04)
  - Portugal > Braga
    - Braga (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.48)

Duplicate Docs Excel Report

Title
48726631f87322012c6be38e00c72a47-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found