Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies