AITopics

Fast yet Safe: Early-Exiting with Risk Control Alexander Timans 1, Tin Hadži Veljković

Neural Information Processing SystemsMar-27-2025, 13:31:45 GMT

Scaling machine learning models significantly improves their performance. However, such gains come at the cost of inference being slow and resource-intensive. Early-exit neural networks (EENNs) offer a promising solution: they accelerate inference by allowing intermediate layers to'exit' and produce a prediction early. Yet a fundamental issue with EENNs is how to determine when to exit without severely degrading performance. In other words, when is it'safe' for an EENN to go'fast'? To address this issue, we investigate how to adapt frameworks of risk control to EENNs. Risk control offers a distribution-free, post-hoc solution that tunes the EENN's exiting mechanism so that exits only occur when the output is of sufficient quality. We empirically validate our insights on a range of vision and language tasks, demonstrating that risk control can produce substantial computational savings, all the while preserving user-specified performance goals.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

b6089408f4893289296ad0499783b3a6-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 13:31:37 GMT

artificial intelligence, experiment, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Sparse Probabilistic Circuits via Pruning and Growing Meihua Dang Anji Liu Guy Van den Broeck CS Department CS Department CS Department UCLA

Neural Information Processing SystemsMar-27-2025, 13:31:33 GMT

Probabilistic circuits (PCs) are a tractable representation of probability distributions allowing for exact and efficient computation of likelihoods and marginals. There has been significant recent progress on improving the scale and expressiveness of PCs.

Add feedback

Semi-Random Matrix Completion via Flow-Based Adaptive Reweighting Jonathan A. Kelner Jerry Li

Neural Information Processing SystemsMar-27-2025, 13:31:26 GMT

Since worst-case statistical inference problems are often intractable (i.e., without distributional

data mining, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.14)

Genre: Research Report > Experimental Study (0.92)

Industry: Information Technology > Services (0.45)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
(3 more...)

Add feedback

Simple and Controllable Music Generation Jade Copet Felix Kreuk

Neural Information Processing SystemsMar-27-2025, 13:31:19 GMT

We tackle the task of conditional music generation.

arxiv preprint arxiv, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East (0.28)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

94b472a1842cd7c56dcb125fb2765fbd-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 13:31:16 GMT

arxiv preprint arxiv, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Geometry-aware training of factorized layers in tensor Tucker format

Neural Information Processing SystemsMar-27-2025, 13:29:16 GMT

Reducing parameter redundancies in neural network architectures is crucial for achieving feasible computational and memory requirements during training and inference phases. Given its easy implementation and flexibility, one promising approach is layer factorization, which reshapes weight tensors into a matrix format and parameterizes them as the product of two small rank matrices. However, this approach typically requires an initial full-model warm-up phase, prior knowledge of a feasible rank, and it is sensitive to parameter initialization. In this work, we introduce a novel approach to train the factors of a Tucker decomposition of the weight tensors. Our training proposal proves to be optimal in locally approximating the original unfactorized dynamics independently of the initialization. Furthermore, the rank of each mode is dynamically updated during training. We provide a theoretical analysis of the algorithm, showing convergence, approximation and local descent guarantees. The method's performance is further illustrated through a variety of experiments, showing remarkable training compression rates and comparable or even better performance than the full baseline and alternative layer factorization strategies.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > New York (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Energy (0.93)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

TransBoost: Improving the Best ImageNet Performance using Deep Transduction Supplementary Material

Neural Information Processing SystemsMar-27-2025, 13:29:09 GMT

Department of Computer Science Department of Computer Science Technion - Israel Institute of Technology Technion - Israel Institute of Technology omer.be@cs.technion.ac.il guy.b@cs.technion.ac.il In general TransBoost is particularly useful when we are able to accumulate a test set of instances and then finetune a specialized model to predict their labels. This setting has numerous use cases in various application fields including: Medicine Medical diagnosis is one possible meaningful use case. In this case, medical records can be gathered on a daily or weekly basis. TransBoost can then be used to finetune transductive models on top of existing inductive models in order to provide more reliable results for these specific records.

artificial intelligence, machine learning, transboost, (15 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel (0.45)

Industry: Health & Medicine (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

b60161e93f3e0e4207081a3b4ef5e8d8-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 13:29:07 GMT

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

A Appendix

Neural Information Processing SystemsMar-27-2025, 13:28:59 GMT

A.1 Algorithms In this section, we provide the pseudo code of the potential-dependent dropping scheme (Algorithm 1) and the overall training procedures (Algorithm 2) of SNNs with our proposed methods. A.2 Details of Datasets and Training Settings A.2.1 MNIST The MNIST dataset contains 60000 images for training and 10000 for testing. Each sample in MNIST is a gray-scale handwritten digit in size of 28 28 pixels. A.2.2 CIFAR10 The CIFAR10 is a collection of 60000 color images, divided into 50000 images for training and 10000 images for testing. All images are equally distributed and labelled as 10 classes.

artificial intelligence, cifar10, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

Filters

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fast yet Safe: Early-Exiting with Risk Control Alexander Timans 1, Tin Hadži Veljković

b6089408f4893289296ad0499783b3a6-Supplemental-Conference.pdf

Sparse Probabilistic Circuits via Pruning and Growing Meihua Dang Anji Liu Guy Van den Broeck CS Department CS Department CS Department UCLA

Semi-Random Matrix Completion via Flow-Based Adaptive Reweighting Jonathan A. Kelner Jerry Li

Simple and Controllable Music Generation Jade Copet Felix Kreuk

94b472a1842cd7c56dcb125fb2765fbd-Paper-Conference.pdf

Geometry-aware training of factorized layers in tensor Tucker format

TransBoost: Improving the Best ImageNet Performance using Deep Transduction Supplementary Material

b60161e93f3e0e4207081a3b4ef5e8d8-Paper-Conference.pdf

A Appendix