An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms