Automated Proof of Polynomial Inequalities via Reinforcement Learning