Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations