Scheduling Distributed Flexible Assembly Lines using Safe Reinforcement Learning with Soft Shielding