AITopics | tct

As 4, number M 100) of FedA M = 1000performs faster gradient Model trained T1 2 {0,20,40,60,80,100}, whereT1 =0 Figure 5(a), we random performs margin20%intest model.

artificial intelligence, arxivpreprintarxiv, machine learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.05)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Neural Information Processing SystemsDec-25-2025, 06:11:27 GMT

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is simultaneously performant for all clients, current federated optimization methods fail to converge to a comparable solution. We show that this performance disparity can largely be attributed to optimization challenges presented by nonconvexity. Specifically, we find that the early layers of the network do learn useful features, but the final layers fail to make use of them. That is, federated optimization applied to this non-convex problem distorts the learning of the final layers. Leveraging this observation, we propose a Train-Convexify-Train (TCT) procedure to sidestep this issue: first, learn features using off-the-shelf methods (e.g., FedAvg); then, optimize a convexified problem obtained from the network's empirical neural tangent kernel approximation. Our technique yields accuracy improvements of up to $+36\%$ on FMNIST and $+37\%$ on CIFAR10 when clients have dissimilar data.

bootstrapped neural tangent kernel, convexifying federated learning, name change, (2 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Provably expressive temporal graph networks (Supplementary material) A Further details on temporal graph networks

Neural Information Processing SystemsAug-19-2025, 02:50:12 GMT

We train all models in link prediction tasks in a self-supervised approach.

graph, mp-tgn, node, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Communications > Social Media (0.70)
Information Technology > Data Science > Data Mining (0.66)

Add feedback

Provably expressive temporal graph networks

Neural Information Processing SystemsAug-19-2025, 02:50:08 GMT

Temporal graph networks (TGNs) have gained prominence as models for embedding dynamic interactions, but little is known about their theoretical underpinnings.

data mining, machine learning, positional feature, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Finland (0.04)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Communications > Social Media (0.70)
Information Technology > Data Science > Data Mining (0.70)

Add feedback

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Neural Information Processing SystemsAug-18-2025, 20:41:40 GMT

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions.

accuracy, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Neural Information Processing SystemsAug-18-2025, 20:41:37 GMT

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions.

artificial intelligence, federated learning, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Neural Information Processing SystemsMay-27-2025, 22:03:18 GMT

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is simultaneously performant for all clients, current federated optimization methods fail to converge to a comparable solution. We show that this performance disparity can largely be attributed to optimization challenges presented by nonconvexity. Specifically, we find that the early layers of the network do learn useful features, but the final layers fail to make use of them. That is, federated optimization applied to this non-convex problem distorts the learning of the final layers.

bootstrapped neural tangent kernel, convexifying federated learning, tct

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.09)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

tct

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

d029c97ee0db162c60f2ebc9cb93387e-Supplemental-Conference.pdf

d029c97ee0db162c60f2ebc9cb93387e-Paper-Conference.pdf

c7649eeb93d2fad0ced9a3b974260710-Supplemental-Conference.pdf

c7649eeb93d2fad0ced9a3b974260710-Paper-Conference.pdf

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Provably expressive temporal graph networks (Supplementary material) A Further details on temporal graph networks

Provably expressive temporal graph networks

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels