too 1 B

Feb-7-2026, 23:15:18 GMT–Neural Information Processing Systems

Wenextcheck5 that trick I does not improve its convergence rate either. Recall the best known convergence rate of PG is given in6 (Xiong etal. Now, applying trick I (minibatch sampling) to PG, we obtain the convergence rate ofO 1(1 γ)2T +O 1(1 γ)2B .10 ForNPG, it can also be checked that trick I does not improve its rate. II does improve the sample complexityO 1(1 γ)8 4 of NPG given in (Agarwal et al. 2019) toO 1(1 γ)7 3 .

assumption 2, linear critic, stationary point, (3 more...)

Neural Information Processing Systems

Feb-7-2026, 23:15:18 GMT

Conferences PDF

Add feedback

Duplicate Docs Excel Report

Title
Reviewer 1: Q1: I wonder if their analysis tricks of AC/NAC when applied to PG methods improve their guarantees

Similar Docs Excel Report more

Title	Similarity	Source
None found