Optimization-Based Algebraic Multigrid Coarsening Using Reinforcement Learning