a97da629b098b75c294dffdc3e463904-Paper.pdf

Neural Information Processing Systems 

Relabelingmethods typically pose the question: if, in hindsight, we assume that our experience was optimal for some task, for what task was it optimal?