Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments