SupplementaryMaterials AProofofTheorem2: AsymptoticConvergenceofRobustQ-Learning