Learning in PyTorch Modern Reinforcement Learning: Deep Q