Parameterized Reinforcement Learning for Optical System Optimization