Learning robust marking policies for adaptive mesh refinement