Review for NeurIPS paper: RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning

Jan-24-2025, 09:01:10 GMT–Neural Information Processing Systems

Weaknesses: My biggest concern is that most of the datasets seem homogenous in terms of data collection sources. Most seem to consist of experience collected from a handful of RL algorithm runs. In real world settings, data collection could take place from heterogenous sources of data, such as humans. In that regard, it seems prudent to keep the task domains fixed and provide datasets that vary the quality of dataset sources along different dimensions (e.g. Data collection through humans could also be considered, as done in prior works like this one (https://arxiv.org/abs/1811.02790) or this one (https://arxiv.org/abs/1909.12200).

benchmark, dataset, offline reinforcement learning, (11 more...)

Neural Information Processing Systems

Jan-24-2025, 09:01:10 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)