Reinforcement Learning-based Product Delivery Frequency Control