AITopics | collegial ensemble

Collegial Ensembles

Neural Information Processing SystemsDec-24-2025, 17:32:33 GMT

Modern neural network performance typically improves as model size increases. A recent line of research on the Neural Tangent Kernel (NTK) of over-parameterized networks indicates that the improvement with size increase is a product of a better conditioned loss landscape. In this work, we investigate a form of over-parameterization achieved through ensembling, where we define collegial ensembles (CE) as the aggregation of multiple independent models with identical architectures, trained as a single model. We show that the optimization dynamics of CE simplify dramatically when the number of models in the ensemble is large, resembling the dynamics of wide models, yet scale much more favorably. We use recent theoretical results on the finite width corrections of the NTK to perform efficient architecture search in a space of finite width CE that aims to either minimize capacity, or maximize trainability under a set of constraints. The resulting ensembles can be efficiently implemented in practical architectures using group convolutions and block diagonal layers. Finally, we show how our framework can be used to analytically derive optimal group convolution modules originally found using expensive grid searches, without having to train a single model.

collegial ensemble, electronic proceedings, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

d958628e70134d9e1e17499a9d815a71-Paper.pdf

Neural Information Processing SystemsNov-15-2025, 10:44:39 GMT

artificial intelligence, ensemble, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

d958628e70134d9e1e17499a9d815a71-Paper.pdf

Neural Information Processing SystemsAug-16-2025, 17:41:36 GMT

architecture, ensemble, neural network, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Review for NeurIPS paper: Collegial Ensembles

Neural Information Processing SystemsFeb-6-2025, 22:47:14 GMT

This paper explores ensembles from the perspective of neural networks in the width limit. The reviewers all found that the paper is well written, technically sound and the contributions are novel and significant. There was significant discussion following the author rebuttal and multiple reviewers were willing to champion the paper for acceptance. One concern shared by reviewers was the motivation for why "var(K)" was a quantity that that should be minimized, as was presented in the theory of the paper. Overall, this seems like an exciting paper that will be of interest at the conference.

collegial ensemble, neurips paper, reviewer

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Collegial Ensembles

Neural Information Processing SystemsOct-11-2024, 12:41:00 GMT

Modern neural network performance typically improves as model size increases. A recent line of research on the Neural Tangent Kernel (NTK) of over-parameterized networks indicates that the improvement with size increase is a product of a better conditioned loss landscape. In this work, we investigate a form of over-parameterization achieved through ensembling, where we define collegial ensembles (CE) as the aggregation of multiple independent models with identical architectures, trained as a single model. We show that the optimization dynamics of CE simplify dramatically when the number of models in the ensemble is large, resembling the dynamics of wide models, yet scale much more favorably. We use recent theoretical results on the finite width corrections of the NTK to perform efficient architecture search in a space of finite width CE that aims to either minimize capacity, or maximize trainability under a set of constraints.

architecture, collegial ensemble, size increase, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Collegial Ensembles

Littwin, Etai, Myara, Ben, Sabah, Sima, Susskind, Joshua, Zhai, Shuangfei, Golan, Oren

arXiv.org Machine LearningJun-17-2020

Modern neural network performance typically improves as model size increases. A recent line of research on the Neural Tangent Kernel (NTK) of over-parameterized networks indicates that the improvement with size increase is a product of a better conditioned loss landscape. In this work, we investigate a form of over-parameterization achieved through ensembling, where we define collegial ensembles (CE) as the aggregation of multiple independent models with identical architectures, trained as a single model. We show that the optimization dynamics of CE simplify dramatically when the number of models in the ensemble is large, resembling the dynamics of wide models, yet scale much more favorably. We use recent theoretical results on the finite width corrections of the NTK to perform efficient architecture search in a space of finite width CE that aims to either minimize capacity, or maximize trainability under a set of constraints. The resulting ensembles can be efficiently implemented in practical architectures using group convolutions and block diagonal layers. Finally, we show how our framework can be used to analytically derive optimal group convolution modules originally found using expensive grid searches, without having to train a single model.

artificial intelligence, ensemble, machine learning, (19 more...)

arXiv.org Machine Learning

2006.07678

Genre: Research Report (0.82)

Technology: