Reactor Optimization Benchmark by Reinforcement Learning