Optimizing Deep Reinforcement Learning for American Put Option Hedging