Adaptive Data Exploitation in Deep Reinforcement Learning

Yuan, Mingqi, Li, Bo, Jin, Xin, Zeng, Wenjun

Jan-21-2025–arXiv.org Artificial Intelligence

We introduce ADEPT: Adaptive Data ExPloiTation, a simple yet powerful framework to enhance the **data efficiency** and **generalization** in deep reinforcement learning (RL). Specifically, ADEPT adaptively manages the use of sampled data across different learning stages via multi-armed bandit (MAB) algorithms, optimizing data utilization while mitigating overfitting. Moreover, ADEPT can significantly reduce the computational overhead and accelerate a wide range of RL algorithms. We test ADEPT on benchmarks including Procgen, MiniGrid, and PyBullet. Extensive simulation demonstrates that ADEPT can achieve superior performance with remarkable computational efficiency, offering a practical solution to data-efficient RL. Our code is available at https://github.com/yuanmingqi/ADEPT.

environment step, machine learning, reinforcement learning, (12 more...)

arXiv.org Artificial Intelligence

Jan-21-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
- Europe > Portugal
  - Braga > Braga (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - China
    - Zhejiang Province > Ningbo (0.04)
    - Hong Kong (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Energy (0.67)
- Education (0.46)
- Leisure & Entertainment > Games (0.45)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found