Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning