Review for NeurIPS paper: RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning

Neural Information Processing Systems 

Weaknesses: My biggest concern is that most of the datasets seem homogenous in terms of data collection sources. Most seem to consist of experience collected from a handful of RL algorithm runs. In real world settings, data collection could take place from heterogenous sources of data, such as humans. In that regard, it seems prudent to keep the task domains fixed and provide datasets that vary the quality of dataset sources along different dimensions (e.g. Data collection through humans could also be considered, as done in prior works like this one (https://arxiv.org/abs/1811.02790) or this one (https://arxiv.org/abs/1909.12200).