A Proof A.1 Proof of Theorem 1 We leverage the results in [ 49
–Neural Information Processing Systems
Lemma 3. Consider the ReLU activation The proof of Theorem 1 is given below. The inequality 3 uses strictly monotone property of p () . Code is available at this link. The neural networks are updated using Adam with learning rate initializes at 0.035 and All of them have no communication constraints. The training time is shown in Table 1.
Neural Information Processing Systems
Feb-17-2026, 09:47:38 GMT