Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations